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DESCRIPTION 

DNA SEQUENCES FROM STAPHYLOCOCCUS AUREUS 
BACTERIOPHAGE 77 THAT ENCODE ANTI-MICROBIAL 
5 POLYPEPTIDES 

RELATED APPLICATIONS 

This application claims the benefit of United States Provisional Application 
10 No. 60/1 10,992, filed December 3, 1998, which is hereby incorporated by reference 
in its entirety, including drawings. 

BACKGROUND OF THE INVENTION 



4» 1 5 This invention relates to the identification of antimicrobial agents and of 

v t microbial targets of such agents, and in particular to the isolation of bacteriophage 

ftJ DNA sequences, and their translated protein products, showing anti-microbial 

f ;j 

7;= activity. The DNA sequences can be expressed in expression vectors. These 

expression constructs and the proteins produced therefrom can be used for a variety 
20 of purposes including therapeutic methods and identification of microbial targets. 

The following description is provided to assist the understanding of the 
reader. None of the information provided or references cited is admitted to be prior 
art to the present invention. 

The frequency and spectrum of antibiotic-resistant infections have, in recent 
25 years, increased in both the hospital and community. Certain infections have 
become essentially untreatable and are growing to epidemic proportions in the 
developing world as well as in institutional settings in the developed world. The 
staggering spread of antibiotic resistance in pathogenic bacteria has been attributed 
to microbial genetic characteristics, widespread use of antibiotic drugs and changes 
30 in society that enhance the transmission of drug-resistant organisms (for a review, 
see Cohen, 1992). This spread of drug resistant microbes is leading to ever- 
increasing morbidity, mortality and health-care costs. 

There are over 160 antibiotics currently available for treatment of microbial 
infections, all based on a few basic chemical structures and targeting a small number 
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of metabolic pathways: bacterial cell wall synthesis, protein synthesis, and DNA 
replication. Despite all these antibiotics, a person could succumb to an infection as a 
result of a resistant bacterial infection. Resistance now reaches all classes of 
antibiotics currently in use, including: p-lactams, fluoroquinolones, 
5 aminoglycosides, macrolide peptides, chloramphenicol, tetracyclines, rifampicin, 
folate inhibitors, glycopeptides, and mupirocin. There is thus a need for new 
antibiotics, and this need will not subside given the ability bacteria have to overcome 
each new agent synthesized. It is also likely that targeting new pathways will play 
an important role in discovery of these new antibiotics. In fact, a number of crucial 

10 cellular pathways, such as secretion, cell division, and many metabolic functions, 
remain untargeted today. 

Most major pharmaceutical companies have on-going drug discovery 
programs for novel anti-microbials. These are based on screens for small molecule 
inhibitors (e.g., natural products, bacterial culture media, libraries of small 

1 5 molecules, combinatorial chemistry) of crucial metabolic pathways of the micro- 
organism of interest. The screening process is largely for cytotoxic compounds and 
in most cases is not based on a known mechanism of action of the compounds. 
Classical drug screening programs are being exhausted and many of these 
pharmaceutical companies are looking towards rational drug design programs. 

20 Several small to mid-size biotechnology companies, as well as large pharmaceutical 
companies, have developed systematic high-throughput sequencing programs to 
decipher the genetic code of specific micro-organisms of interest. The goal is to 
identify, through sequencing, unique biochemical pathways or intermediates that are 
unique to the microorganism. Knowledge of the function of these bacterial genes, 

25 may form the rationale for a drug discovery program based on the mechanism of 
action of the identified enzymes/proteins. However, one of the most critical steps in 
this approach is the ascertainment that the identified proteins and biochemical 
pathways are 1) non-redundant and essential for bacterial survival, and 2) constitute 
suitable and accessible targets for drug discovery. These two issues are not easily 

30 addressed since to date, 18 prokaryotic genomes have been sequenced and 200 

sequenced genomes are expected by the year 2000. For a majority of the sequenced 
genomes, less than 50% of the open reading frames (ORFs) have been linked to a 
known function. Even with the genome of Escherichia coli (E. coli), the most 
extensively studied bacterium, less than two-thirds of the annotated protein coding 
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genes showed significant similarity to genes with ascribed functions (Rusterholtz 
and Pohlschroder, 1999). Thus considerable work must be undertaken to identify 
appropriate bacterial targets for drug screening. 



The present invention is based on the identification of, and demonstration that, 
specific DNA sequences of a bacteriophage, when introduced into a host bacterium 
can kill, or inhibit growth, of the host. Thus, these DNA sequences are anti- 
10 microbial agents. Information based on these DNA sequences can be utilized to 
develop peptide mimetics that can also function also as anti-microbials. The 
identification of the host bacterial proteins, targeted by the anti-microbial 
bacteriophage DNA sequences, can provide novel targets for drug design and 
compound screening. 

15 In this regard, the terms "inhibit", "inhibition", "inhibitory", and "inhibitor" all 

refer to a function of reducing a biological activity or function. Such reduction in 
activity or function can, for example, be in connection with a cellular component 
(e.g., an enzyme), or in connection with a cellular process (e.g., synthesis of a 
particular protein), or in connection with an overall process of a cell (e.g., cell 

20 growth). In reference to cell growth, the inhibitory effects may be bactericidal 

(killing of bacterial cells) or bacteriostatic (i.e., stopping or at least slowing bacterial 
cell growth). The latter slows or prevents cell growth such that fewer cells of the 
strain are produced relative to uninhibited cells over a given time period. From a 
molecular standpoint, such inhibition may equate with a reduction in the level of, or 

25 elimination of, the transcription and/or translation of a specific bacterial target(s), or 
reduction or elimination of activity of a particular target biomolecule. - 



antibacterial agents by identifying the bacterial target(s) of at least one inhibitory 
gene product, e.g., protein from ORFs 17, 19, 43, 102, 104, and 182 of 
30 bacteriophage 77 or a homologous product. Such identification allows the 
development of antibacterial agents active on such targets. Preferred embodiments 
for identifying such targets involve the identification of binding of target and phage 
ORF products to one another. The target molecule may be a bacterial protein or 
other bacterial biomolecule, e.g., a nucleotprotein, a nucleic acid, a lipid or lipid- 
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containing molecule, a nucleoside or nucleoside derivative, a polysaccharide or 
polysaccharide-containing molecule, or a peptidoglycan. The phage ORF products 
may be subportions of a larger ORF product that also binds the host target. 
Exemplary approaches are described below in the Detailed Description. 

Additionally, the invention provides methods for identifying targets for 
antibacterial agents by identifying homologs of a Staphylococcus aureus target of a 
bacteriophage ORFs 17, 19, 43, 102, 104, or 182 product. Such homologs may be 
utilized in the various aspects and embodiments described herein. 

The term "fragment" refers to a portion of a larger molecule or assembly. For 
proteins, the term "fragment" refers to a molecule which includes at least 5 
contiguous amino acids from the reference polypeptide or protein, preferably at least 
6, 8, 10, 12, 15, 20, 30, 50 or more contiguous amino acids. In connection with 
oligo- or polynucleotides, the term "fragment" refers to a molecule which includes at 
least 15 contiguous nucleotides from a reference polynucleotide, preferably at least 
18, 21, 24, 30, 36, 45, 60, 90, 150, or more contiguous nucleotides. Also in 
preferred embodiments, the fragment has a length in a range with the minimum as 
described above and a maximum which is no more than 90 % of the length (or 
contains that percent of the contiguous amino acids or nucleotides) of the larger 
molecule (e.g., of the specified ORF), in other embodiments, the upper limit is no 
more than 60, 70, or 80% of the length of the larger molecule. 

Stating that an agent or compound is "active on" a particular cellular target, 
such as the product of a particular gene, means that the target is an important part of 
a cellular pathway which includes that target and that the agent acts on that pathway. 
Thus, in some cases the agent may act on a component upstream or downstream of 
the stated target, including a regulator of that pathway or a component of that 
pathway. In general, an antibacterial agent is active on an essential cellular function, 
often on a product of an essential gene. 

By "essential", in connection with a gene or gene product, is meant that the 
host cannot survive without, or is significantly growth compromised, in the absence 
or depletion of functional product. An "essential gene" is thus one that encodes a 
product that is beneficial, or preferably necessary, for cellular growth in vitro in a 
medium appropriate for growth of a strain having a wild-type allele corresponding to 
the particular gene in question. Therefore, if an essential gene is inactivated or 
inhibited, that cell will grow significantly more slowly or even not at all. Preferably 
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growth of a strain in which such a gene has been inactivated will be less than 20%, 
more preferably less than 10%, most preferably less than 5% of the growth rate of 
the wild-type, or not at all, in the growth medium. Preferably, in the absence of 
activity provided by a product of the gene, the cell will not grow at all or will be 
5 non-viable, at least under culture conditions similar to normal in vivo growth 
conditions; For example, absence of the biological activity of certain enzymes 
involved in bacterial cell wall synthesis can result in the lysis of cells under normal 
osmotic conditions, even though protoplasts can be maintained under controlled 
osmotic conditions. Preferably, but not necessarily, if such a gene is inhibited, e.g., 

10 with an antibacterial agent or a phage product, the growth rate of the inhibited 
bacteria will be less than 50%, more preferably less than 30%, still more preferably 
less than 20%, and most preferably less than 10% of the growth rate of the 
uninhibited bacteria. As recognized by those skilled in the art, the degree of growth 
inhibition will generally depend on the concentration of the inhibitory agent. In the 

15 context of the invention, essential genes are generally the preferred targets of 
antimicrobial agents. Essential genes can encode target molecules directly or can 
encode a product involved in the production, modification, or maintenance of a 
target molecule. 

A "target" refers to a biomolecule that can be acted on by an exogenous agent, 
20 thereby modulating, preferably inhibiting, growth or viability of a cell. In most 
cases such a target will be a nucleic acid sequence or molecule, or a polypeptide or 
protein. However, other types of biomolecules can also be targets, e.g., membrane 
lipids and cell wall structural components. 

The term "bacterium" refers to a single bacterial strain, and includes a single 
25 cell, and a plurality or population of cells of that strain unless clearly indicated to the 
contrary. In reference to bacteria or bacteriophage, the term "strain" refers to 
bacteria or phage having a particular genetic content. The genetic content includes 
genomic content as well as recombinant vectors. Thus, for example, two otherwise 
identical bacterial cells would represent different strains if each contained a vector, 
30 e.g., a plasmid, with different phage ORF inserts. 

In the context of the phage nucleic acid sequences, e.g., gene sequences, of 
this invention, the terms "homolog" and "homologous" denote nucleotide sequences 
from different bacteria or phage strains or species or from other types of organisms 
that have significantly related nucleotide sequences, and consequently significantly 
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related encoded gene products, preferably having related function. Homologous 
gene sequences or coding sequences have at least 70% sequence identity (as defined 
by the maximal base match in a computer-generated alignment of two or more 
nucleic acid sequences) over at least one sequence window of 48 nucleotides, more 
5 preferably at least 80 or 85%, still more preferably at least 90%, and most preferably 
at least 95%. The polypeptide products of homologous genes have at least 35% 
amino acid sequence identity over at least one sequence window of 18 amino acid 
residues, more preferably at least 40%, still more preferably at least 50% or 60%, 
and most preferably at least 70%, 80%, or 90%. Preferably, the homologous gene 

10 product is also a functional homolog, meaning that the homolog will functionally 
complement one or more biological activities of the product being compared. For 
nucleotide or amino acid sequence comparisons where a homology is defined by a % 
sequence identity, the percentage is determined using BLAST programs ( with 
default parameters (Altschul et al., 1997, "Gapped BLAST and PSI-BLAST: a new 

15 generation of protein database search programs, Nucleic Acid Res. 25:3389-3402). 
Any of a variety of algorithms known in the art which provide comparable results 
can also be used, preferably using default parameters. Performance characteristics 
for three different algorithms in homology searching is described in Salamov et al., 
1999, "Combining sensitive database searches with multiple intermediates to detect 

20 distant homologues." Protein Eng. 12:95-100. Another exemplary program 
package is the GCG™ package from the University of Wisconsin. 

Homologs may also or in addition be characterized by the ability of two 
complementary nucleic acid strands to hybridize to each other under appropriately 
stringent conditions. Hybridizations are typically and preferably conducted with 



25 probe-length nucleic acid molecules, preferably 20-100 nucleotides in length. Those 
skilled in the art understand how to estimate and adjust the stringency of 
hybridization conditions such that sequences having at least a desired level of 
complementarity will stably hybridize, while those having lower complementarity 
will not. For examples of hybridization conditions and parameters, see, e.g.,. 

30 Maniatis, T. et al. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring 
Harbor University Press, Cold Spring, N.Y.; Ausubel, F.M. et al. (1994) Current 
Protocols in Molecular Biology . John Wiley & Sons, Secaucus, NJ. Homologs and 
homologous gene sequences may thus be identified using any nucleic acid sequence 
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of interest, including the phage ORFs and bacterial target genes of the present 



invention. 



A typical hybridization, for example, utilizes, besides the labeled probe of 



interest, a salt solution such as 6X SSC (NaCl and Sodium Citrate base) to stabilize 
5 nucleic acid strand interaction, a mild detergent such as 0.5% SDS, together with 
other typical additives such as Denhardt's solution and salmon sperm DNA. The 
solution is added to the immobilized sequence to be probed and incubated at suitable 
temperatures to preferably permit specific binding while minimizing nonspecific 
binding. The temperature of the incubations and ensuing washes is critical to the 

10 success and clarity of the hybridization. Stringent conditions employ relatively 
higher temperatures, lower salt concentrations, and/or more detergent than do non- 
stringent conditions. Hybridization temperatures also depend on the length, 
complementarity level, and nature (i.e., "GC content") of the sequences to be tested. 
Typical stringent hybridizations and washes are conducted at temperatures of at least 

1 5 40°C, while lower stringency hybridizations and washes are typically conducted at 
37°C down to room temperature (~25°C). One of ordinary skill in the art is aware 
that these conditions may vary according to the parameters indicated above, and that 
certain additives such as formamide and dextran sulphate may also be added to 
affect the conditions. 

20 By "stringent hybridization conditions" is meant hybridization conditions at 

least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 
mM NaH 2 P0 4 , pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 
5X Denhart's solution at 42°C overnight; washing with 2X SSC, 0.1% SDS at 45°C; 
and washing with 0.2X SSC, 0.1% SDS at 45°C. 

25 Homologous nucleotide sequences will distinguishably hybridize with a 

reference sequence with up to three mismatches in ten (i.e., at least 70% base match 
in two sequences of equal length). Preferably, the allowable mismatch level is up to 
two mismatches in 10, or up to one mismatch in ten, more preferably up to one 
mismatch in twenty. (Those ratios can, of course, be applied to larger sequences.) 

30 Preferred embodiments involve identification of binding between ORF 

product and bacterial cellular component that include methods for distinguishing 
bound molecules, for example, affinity chromatography, immunoprecipitation, 
crosslinking, and/or genetic screen methods that permit proteimprotein interactions 
to be monitored. One of skill in the art is familiar with these techniques and common 
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materials utilized (see, e.g., Coligan, J. et al. (eds.) (1995) Current Protocols in 
Protein Science, John Wiley & Sons, Secaucus, N J.). 

Genetic screening for the identification of proteinrprotein interactions typically 
involves the co-introduction of both a chimeric bait nucleic acid sequence (here, the 
5 phage ORF to be tested) and a chimeric target nucleic acid sequence that, when co- 
expressed and having affinity for one another in a host cell, stimulate reporter gene 
expression to indicate the relationship. A "positive" can thus suggest a potential 
inhibitory effect in bacteria. This is discussed in further detail in the Detailed 
Description section below. In this way, new bacterial targets can be identified that 
10 are inhibited by specific phage ORF products or derivatives, fragments, mimetics, or 
other molecules. 

Other embodiments involve the identification and/or utilization of a target 
which is mutated at the site of phage 77 protein interaction but still functional in the 
cell by virtue of their host's relatively unresponsive nature in the presence of 
15 expression of ORFs previously identified as inhibitory to the non-mutant or wild- 
type strain. Such mutants have the effect of protecting the host from an inhibition 
that would otherwise occur (e.g., by competing for binding with the phage ORF 
a |;; product) and indirectly allow identification of the precise responsible target. The 

L identified target can then be used, e.g., for follow-up studies and anti-microbial 

*S 20 development. In certain embodiments, rescue from inhibition occurs under 
flj 

ff\ conditions in which a bacterial target or mutant target is highly expressed. This is 



Q 



^1 



performed, for example, through coupling of the sequence with regulatory element 
promoters, e.g., as known in the art, which regulate expression at levels higher than 
wild-type, e.g., at a level sufficiently higher that the inhibitor can be competitively 
25 bound to the highly expressed target such that the bacterium is detectably less 
inhibited. 

Identification of the bacterial target can involve identification of a phage- 
specific site of action. This can involve a newly identified target, or a target where 
the phage site of action differs from the site of action of a previously known 
30 antibacterial agent or inhibitor. For example, phage T7 genes 0.7 and 2.0 target the 
host RNA polymerase, which is also the cellular target for the antibacterial agent, 
rifampin. To the extent that a phage product is found to act at a different site than 
previously described inhibitors, aspects of the present invention can utilize those 
new, phage-specific sites for identification and use of new agents. The site of action 
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can be identified by techniques known to those skilled in the art, for example, by 
mutational analysis, binding competition analysis, and/or other appropriate 
techniques. 

Once a bacterial host target or mutant target sequence has been identified, it 
too can be conveniently sequenced, sequence analyzed (e.g., by computer), and the 
underlying gene(s), and corresponding translated product(s) further characterized. 
Preferred embodiments include such analysis and identification. Preferably such a 
target has not previously been identified as an appropriate target for antibacterial 
action. 

Also in preferred embodiments in which the bacterial target is a polypeptide or 
nucleic acid molecule, the identification of a bacterial target of a phage ORF product 
or fragment includes identification of a cellular and/or biochemical function of the 
bacterial target. As understood by those skilled in the art, this can, for example, 
include identification of function by identification of homologous polypeptides or 
nucleic acid molecules having known function, or identification of the presence of 
known motifs or sequences corresponding to known function. Such identifications 
can be readily performed using sequence comparison computer software, such as the 
BLAST programs and similar other programs and sequence and motif databases. 

In embodiments involving expression of a phage ORF in a bacterial strain, in 
preferred embodiments that expression is inducible. By "inducible" is meant that 
expression is absent or occurs at a low level until the occurrence of an appropriate 
environmental stimulus provides otherwise. For the present invention such 
induction is preferably controlled by an artificial environmental change, such as by 
contacting a bacterial strain population with an inducing compound (i.e., an 
inducer). However, induction could also occur, for example, in response to build-up 
of a compound produced by the bacteria in the bacterial culture, e.g., in the medium. 
As uncontrolled or constitutive expression of inhibitory ORFs can severely 
compromise bacteria to the point of eradication, such expression is therefore 
undesirable in many cases because it would prevent effective evaluation of the strain 
and inhibitor being studied. For example, such uncontrolled expression could 
prevent any growth of the strain following insertion of a recombinant ORF, thus 
preventing determination of effective transfection or transformation. A controlled or 
inducible expression is therefore advantageous and is generally provided through the 
provision of suitable regulatory elements, e.g., promoter/operator sequences that can 
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be conveniently transcriptionally linked to a coding sequence to be evaluated. In 
most cases, the vector will also contain sequences suitable for efficient replication of 
the vector in the same or different host cells and/or sequences allowing selection of 
cells containing the vector, i.e., "selectable markers." Further, preferred vectors 
include convenient primer sequences flanking the cloning region from which PCR 
and/or sequencing may be performed. In preferred embodiments where the 
purification of phage product is desired, preferably the bacterium or other cell type 
does not produce a target for the inhibitory product, or is otherwise resistant to the 
inhibitory product. 

In preferred embodiments, the target of the phage ORF product or fragment is 
identified from a bacterial animal pathogen, preferably a mammalian pathogen, more 
preferably a human pathogen, and is preferably a gene or gene product of such a 
pathogen. Also in preferred embodiments, the target is a gene or gene product, 
where the sequence of the target is homologous to a gene or gene product from such 
a pathogen as identified above. 

As used herein, the term "mammal" has its usual biological meaning, and 
particularly includes bovines, swine, dogs, cats, and humans. 

Other aspects of the invention provide isolated, purified, or enriched specific 
phage nucleic acid and amino acid sequences, subsequences, and homologs thereof 
from or corresponding to ORFs 17, 19, 43, 102, 104, and 182 from bacteriophage 77 
(Staphylococcus aureus host bacterium). Such nucleotide sequences are at least 15 
nucleotides in length, preferably at least 18, 21, 24, or 27 nucleotides in length, more 
preferably at least 30, 50, or 90 nucleotides in length. In certain embodiments, 
longer nucleic acids are preferred, for example those of at least 120, 150, 200, 300, 
600, 900 or more nucleotides. Such sequences can, for example, be amplification 
oligonucleotides (e.g., PCR primers), oligonucleotide probes, sequences encoding a 
portion or all of a phage-encoded protein, or a fragment or all of a phage-encoded 
protein. In preferred embodiments, the nucleic acid sequence or amino acid 
sequence contains a sequence which has a lower length as specified above, and an 
upper-length limit which is no more than 50, 60, 70, 80, or 90% of the length of the 
full-length ORF or ORF product. The upper-length limit can also be expressed in 
terms of the number of base pairs of the ORF (coding region). 

As it is recognized that alternate codons will encode the same amino acid for 
most amino acids due to the degeneracy of the genetic code, the sequences of this 
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aspect includes nucleic acid sequences utilizing such alternate codon usage for one 
or more codons of a coding sequence. For example, all four nucleic acid sequences 
GCT, GCC, GCA, and GCG encode the amino acid, alanine. Therefore, if for an 
amino acid there exists an average of three codons, a polypeptide of 100 amino acids 
5 in length will, on average, be encoded by 3 100 , or 5 x 10 47 , nucleic acid sequences. 
Thus, a nucleic acid sequence can be modified (e.g., a nucleic acid sequence from a 
phage as specified above) to form a second nucleic acid sequence encoding the same 
polypeptide as encoded by the first nucleic acid sequence using routine procedures 
and without undue experimentation. Thus, all possible nucleic acid sequences that 
10 encode the amino acid sequences encoded by the phage 77 ORFs 17, 19, 43, 102, 
104, and 182, as if all were written out in full, taking into account the codon usage, 
especially that preferred in the host bacterium. 



example, Stryer, BIOCHEMISTRY 3 rd ed., and Lehninger, BIOCHEMISTRY 3 rd ed. 

15 Codon preference tables for various types of organisms are available in the literature. 
Because of the number of sequence variations involving alternate codon usage, for 
the sake of brevity, individual sequences are not separately listed herein. Instead the 
alternate sequences are described by reference to the natural sequence with 
replacement of one or more (up to all) of the degenerate codons with alternate 

20 codons from the alternate codon table (Table 2), preferably with selection according 
to preferred codon usage for the normal host organism or a host organism in which a 
sequence is intended to be expressed. Those skilled in the art also understand how 
to alter the alternate codons to be used for expression in organisms where certain 
codons code differently than shown in the "universal" codon table. 

25 For amino acid sequences, sequences contain at least 5 peptide-linked amino 

acid residues, and preferably at least 6, 7, 10, 15, 20, 30, or 40, amino _acids having 
identical amino acid sequence as the same number of contiguous amino acid 
residues in a phage ORF 17, 19, 43, 102, 104, or 182 product. In some cases longer 
sequences may be preferred, for example, those of at least 50, 70, or 100 amino acids 

30 in length. In preferred embodiments, the sequence has bacteria-inhibiting function 
when expressed or otherwise present in a bacterial cell which is a host for the 
bacteriophage from which the sequence was derived. 

By "isolated" in reference to a nucleic acid is meant that a naturally occurring 
sequence has been removed from its normal cellular (e.g., chromosomal) 

SD-l 10719.1 



The alternate codon descriptions are available in common textbooks, for 



12 Patent 

241/190 



V.r 1 



environment or is synthesized in a non-natural environment {e.g., artificially 
synthesized). Thus, the sequence may be in a cell-free solution or placed in a 
different cellular environment. The term does not imply that the sequence is the 
only nucleotide chain present, but that it is essentially free (about 90-95% pure at 
5 least) of non-nucleotide material naturally associated with it, and thus is 
distinguished from isolated chromosomes. 

The term "enriched" means that the specific DNA or RNA sequence 
constitutes a significantly higher fraction (2-5 fold) of the total DNA or RNA 
present in the cells or solution of interest than in normal or diseased cells or in cells 
10 from which the sequence was originally taken. This could be caused by a person by 
preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply 

that there are no other DNA or RNA sequences present, just that the relative amount 

y;i 

£ 15 of the sequence of interest has been significantly increased. 

5 1 The term "significant" is used to indicate that the level of increase is useful to 

C|i the person making such an increase and an increase relative to other nucleic acids of 

if": 

1;; about at least 2-fold, more preferably at least 5- to 10-fold or even more. The term 

]L also does not imply that there is no DNA or RNA from other sources. The other 

U-) 20 source DNA may, for example, comprise DNA from a yeast or bacterial genome, or 

E": : 

f!\ a cloning vector such as pUC 19. This term distinguishes from naturally occurring 

events, such as viral infection, or tumor type growths, in which the level of one 
mRNA may be naturally increased relative to other species of mRNA. That is, the 
term is meant to cover only those situations in which a person has intervened to 

25 elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in 
purified form. The term "purified" in reference to nucleic acid does not require 
absolute purity (such as a homogeneous preparation). Instead, it represents an 
indication that the sequence is relatively more pure than in the natural environment 

30 (compared to the natural level, this level should be at least 2-5 fold greater, e.g., in 
terms of mg/mL). Individual clones isolated from a genomic or cDNA library may 
be purified to electrophoretic homogeneity. The claimed DNA molecules obtained 
from these clones could be obtained directly from total DNA or from total RNA. 
cDNA clones are not naturally occurring, but rather are preferably obtained via 
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manipulation of a partially purified naturally occurring substance (messenger RNA). 
The construction of a cDNA library from mRNA involves the creation of a synthetic 
substance (cDNA) and pure individual cDNA clones can be isolated from the 
synthetic library by clonal selection of the cells carrying the cDNA library. Thus, 
the process which includes the construction of a cDNA library from mRNA and 
isolation of distinct cDNA clones yields an approximately 10 6 -fold purification of 
the native message. Thus, purification of at least one order of magnitude, preferably 
two or three orders, and more preferably four or five orders of magnitude is 
expressly contemplated. A genomic library can be used in the same way and yields 
the same approximate levels of purification. 

The terms "isolated", "enriched", and "purified" with respect to the nucleic 
acids, above, may similarly be used to denote the relative purity and abundance of 
polypeptides (multimers of amino acids joined one to another by a-carboxyl:a- 
amino group (peptide) bonds). These, too, may be stored in, grown in, screened in, 
and selected from libraries using biochemical techniques familiar in the art. Such 
polypeptides may be natural, synthetic or chimeric and may be extracted using any 
of a variety of methods, such as antibody immunoprecipitation, other "tagging" 
techniques, conventional chromatography and/or electrophoretic methods. Some of 
the above utilize the corresponding nucleic acid sequence. 

As indicated above, aspects and embodiments of the invention are not limited 
to entire genes and proteins. The invention also provides and utilizes fragments and 
portions thereof, preferably those which are "active" in the inhibitory sense 
described above. Such peptides or oligopeptides and oligo or polynucleotides have 
preferred lengths as specified above for nucleic acid and amino acid sequences from 
phage; corresponding recombinant constructs can be made to express the encoded 
same. Also included are homologous sequences and fragments thereof. 

The nucleotide and amino acid sequences identified herein are believed to be 
correct, however, certain sequences may contain a small percentage of errors, e.g., 1- 
5%. In the event that any of the sequences have errors, the corrected sequences can 
be readily provided by one skilled in the art using routine methods. For example, 
the nucleotide sequences can be confirmed or corrected by obtaining and culturing 
the relevant phage, and purifying phage genomic nucleic acids. A region or regions 
of interest can be amplified, e.g., by PCR from the appropriate genomic template, 
using primers based on the described sequence. The amplified regions can then be 
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sequenced using any of the available methods (e.g. , a dideoxy termination method, 
for example, using commercially available products). This can be done redundantly 
to provide the corrected sequence or to confirm that the described sequence is 
correct. Alternatively, a particular sequence or sequences can be identified and 
5 isolated as an insert or inserts in a phage genomic library and isolated, amplified, 
and sequenced by standard methods. Confirmation or correction of a nucleotide 
sequence for a phage gene provides an amino acid sequence of the encoded product 
by merely reading off the amino acid sequence according to the normal codon 
relationships and/or expressed in a standard expression system and the polypeptide 
10 product sequenced by standard techniques. The sequences described herein thus 
provide unique identification of the corresponding genes and other sequences, 
allowing those sequences to be used in the various aspects of the present invention. 
Confirmation of a phage ORF encoded amino acid sequence can also be confirmed 
t» by constructing a recombinant vector from which the ORF can be expressed in. an 

J;: 1 5 appropriate host (e.g., E. coli), purified, and sequenced by conventional protein 

O 



sequencing methods. 

C8 In other aspects the invention provides recombinant vectors and cells 

O 

j* harboring phage 77 ORF 17, 19, 43, 102, 104, or 182 or portions thereof, or bacterial 

;L target sequences described herein, preferably where the phage or bacterial sequence 

v!;i 20 is inserted in a recombinant vector. As understood by those skilled in the art, 

fji 

f)\ vectors may assume different forms, including, for example, plasmids, cosmids, and 



s 

V:.- 



\!3 virus-based vectors. See, e.g., Maniatis, T. et al. (1989} Molecular Cloning: A 

Laboratory Manual. Cold Spring Harbor University Press, Cold Spring, N.Y.; See 
also, Ausubel, F.M. et al. (eds.) (1994) Current Protocols in Molecular Biology . 
25 John Wiley & Sons, Secaucus, N.J. 

In preferred embodiments, the vectors will be expression vectors, preferably 
shuttle vectors that permit cloning, replication, and expression within bacteria. An 
"expression vector" is one having regulatory nucleotide sequences containing 
transcriptional and translational regulatory information that controls expression of 
30 the nucleotide sequence in a host cell. Preferably the vector is constructed to allow 
amplification from vector sequences flanking an insert locus. In certain 
embodiments, the expression vectors may additionally or alternatively support 
expression, and/or replication in animal, plant and/or yeast cells due to the presence 
of suitable regulatory sequences, e.g., promoters, enhancers, 3' stabilizing 
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sequences, primer sequences, etc. In preferred embodiments, the promoters are 
inducible and specific for the system in which expression is desired, e.g., bacteria, 
animal, plant, or yeast. The vectors may optionally encode a "tag" sequence or 
sequences to facilitate protein purification or protein detection. Convenient 
restriction enzyme cloning sites and suitable selective markers) are also optionally 
included. Such selective markers can be, for example, antibiotic resistance markers 
or markers which supply an essential nutritive growth factor to an otherwise 
deficient mutant host, e.g., tryptophan, histidine, or leucine in the Yeast Two-Hybrid 
systems described below. 

The term "recombinant vector" relates to a single- or double-stranded circular 
nucleic acid molecule that can be transfected into cells and replicated within or 
independently of a cell genome. A circular double-stranded nucleic acid molecule 
can be cut and thereby linearized upon treatment with appropriate restriction 
enzymes. An assortment of nucleic acid vectors, restriction enzymes, and the 
knowledge of the nucleotide sequences cut by restriction enzymes are readily 
available to those skilled in the art. A nucleic acid molecule encoding a desired 
product can be inserted into a vector by cutting the vector with restriction enzymes 
and ligating the two pieces together. Preferably the vector is an expression vector, 
e.g., a shuttle expression vector as described above. 

By " recombinant cell" is meant a cell possessing introduced or engineered 
nucleic acid sequences, e.g., as described above. The sequence may be in the form 
of or part of a vector or may be integrated into the host cell genome. Preferably the 
cell is a bacterial cell. 

In preferred embodiments, the inserted nucleic acid sequence corresponding to 
at least a portion of a bacteriophage 77 ORF 17, 19, 43, 102, 104, and 182 gene 
product has a length as specified for the isolated purified or enriched nucleic acid 
sequences in an aspect above. 

In another aspect, the invention also provides methods for identifying and/or 
screening compounds "active on" at least one bacterial target of a bacteriophage 
inhibitor protein or RNA. Preferred embodiments involve contacting bacterial target 
proteins with a test compound, and determining whether the compound binds to or 
reduces the level of activity of the bacterial target, e.g., a bacterial protein. 
Preferably this is done in vivo under approximately physiological conditions. The 
compounds that can be used may be large or small, synthetic or natural, organic or 
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inorganic, proteinaceous or non-proteinaceous. In preferred embodiments, the 
compound is a peptidomimetic, as described herein, a bacteriophage inhibitor 
protein or fragment or derivative thereof, preferably an "active portion", or a small 
molecule. In particular embodiments, the methods include the identification of 
5 bacterial targets as described above or otherwise described herein. Preferably the 
fragment of a bacteriophage inhibitor protein includes less than 80% of an intact 
bacteriophage inhibitor protein. Preferably, the at least one target includes a 
plurality of different targets of bacteriophage inhibitor proteins, preferably a 
plurality of different targets. The plurality of targets can be in or from a plurality of 
10 different bacteria, but preferably is from a single bacterial species. 

In embodiments involving binding assays, preferably binding is to a fragment 
or portion of a bacterial target protein, where the fragment includes less than 90%, 
80%, 70%, 60%, 50%, 40%, or 30% of an intact bacterial target protein. Preferably, 
the at least one bacterial target includes a plurality of different targets of 
15 bacteriophage inhibitor proteins, preferably a plurality of different targets. The 
plurality of targets can be in or from a plurality of different bacteria, but preferably 
is from a single bacterial species. 

A "method of screening" refers to a method for evaluating a relevant activity 

5 or property of a large plurality of compounds, rather than just one or a few 

O 

j;i 20 compounds. For example, a method of screening can be used to conveniently test at 

rli 



O 

5*5"s 



least 100, more preferably at least 1000, still more preferably at least 10,000, and 



U;i most preferably at least 100,000 different compounds, or even more. 



In the context of this invention, the term "small molecule" refers to 
compounds having molecular mass of less than 3000 Daltons, preferably less than 

25 2000 or 1500, still more preferably less than 1000, and most preferably less than 600 
Daltons. Preferably but not necessarily, a small molecule is not an oligopeptide. 

In a related aspect or in preferred embodiments, the invention provides a 
method of screening for potential antibacterial agents by determining whether any of 
a plurality of compounds, preferably a plurality of small molecules, is active on at 

30 least one target of a bacteriophage inhibitor protein or RNA. Preferred embodiments 
include those described for the above aspect, including embodiments which involve 
determining whether one or more test compounds bind to or reduce the level of 
activity of a bacterial target, and embodiments which utilize a plurality of different 
targets as described above. 
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The identification of bacteria-inhibiting phage ORFs and their encoded 
products also provides a method for identifying an active portion of such an encoded 
product. This also provides a method for identifying a potential antibacterial agent 
by identifying such an active portion of a phage ORF or ORF product. In preferred 
embodiments, the identification of an active portion involves one or more of 
mutational analysis, deletion analysis, or analysis of fragments of such products. 
The method can also include determination of a 3-dimensional structure of an active 
portion, such as by analysis of crystal diffraction patterns. In further embodiments, 
the method involves constructing or synthesizing a peptidomimetic compound, 
where the structure of the peptidomimetic compound corresponds to the structure of 
the active portion. 

In this context, "corresponds" means that the peptidomimetic compound 
structure has sufficient similarities to the structure of the active portion that the 
peptidomimetic will interact with the same molecule as the phage protein and 
preferably will elicit at least one cellular response in common which relates to the 
inhibition of the cell by the phage protein. 

The methods for identifying or screening for compounds or agents active on a 
bacterial target of a phage-encoded inhibitor can also involve identification of a 
phage-specific site of action on the target. 

An "active portion" as used herein denotes an epitope, a catalytic or regulatory 
domain, or a fragment of a bacteriophage inhibitor protein that is responsible for, or 
a significant factor in, bacterial target inhibition. The active portion preferably may 
be removed from its contiguous sequences and, in isolation, still effect inhibition. 

By "mimetic" is meant a compound structurally and functionally related to a 
reference compound that can be natural, synthetic, or chimeric. In terms of the 
present invention, a "peptidomimetic," for example, is a compound that mimics the 
activity-related aspects of the 3-dimensional structure of a peptide or polypeptide in 
a non-peptide compound, for example mimics the structure of a peptide or active 
portion of a phage- or bacterial ORF-encoded polypeptide. 

A related aspect provides a method for inhibiting a bacterial cell by contacting 
the bacterial cell with a compound active on a bacterial target of a bacteriophage 
inhibitor protein or RNA encoded by bacteriophage 77 ORF 17, 19, 43, 102, 104, or 
182, where the target was uncharacterized. In preferred embodiments, the 
compound is such a protein, or a fragment or derivative thereof; a structural mimetic, 
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e.g., a peptidomimetic, of such a protein or fragment; a small molecule; the 
contacting is performed in vitro, the contacting is performed in vivo in an infected or 
at risk organism, e.g., an animal such as a mammal or bird, for example, a human, or 
other mammal described herein, or in a plant. 

In the context of this invention, the term "bacteriophage inhibitor protein" 
refers to a protein encoded by a bacteriophage nucleic acid sequence which inhibits 
bacterial function in a host bacterium. Thus, it is a bacteria-inhibiting phage 
product. 

In the context of this invention, the phrase "contacting the bacterial cell with a 
compound active on a bacterial target of a bacteriophage inhibitor protein" or 
equivalent phrases refer to contacting with an isolated, purified, or enriched 
compound or a composition including such a compound, but specifically does not 
rely on contacting the bacterial cell with an intact naturally occurring phage which 
encodes the compound. Preferably no intact phage are involved in the contacting. 

Related aspects provide methods for prophylactic or therapeutic treatment of a 
bacterial infection by administering to an infected, challenged or at risk organism a 
therapeutically or prophylactically effective amount of a compound active on a 
target of a bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182 product, e.g., as 
described for the previous aspect. Preferably the bacterium involved in the infection 
or risk of infection produces the identified target of the bacteriophage inhibitor 
protein or alternatively produces a homologous target compound. In preferred 
embodiments, the host organism is a plant or animal, preferably a mammal or bird, 
and more preferably, a human or other mammal described herein. Preferred 
embodiments include, without limitation, those as described for the preceding 
aspect. 

Compounds useful for the methods of inhibiting, methods of treating, and 
pharmaceutical compositions can include novel compounds, but can also include 
compounds which had previously been identified for a purpose other than inhibition 
of bacteria. Such compounds can be utilized as described and can be included in 
pharmaceutical compositions. 

By "treatment" or "treating" is meant administering a compound or 
pharmaceutical composition for prophylactic and/or therapeutic purposes. The term 
"prophylactic treatment" refers to treating a patient or animal that is not yet infected 
but is susceptible to or otherwise at risk of a bacterial infection. The term 
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"therapeutic treatment" refers to administering treatment to a patient already 
suffering from infection. 

The term "bacterial infection" refers to the invasion of the host organism, 
animal or plant, by pathogenic bacteria. This includes the excessive growth of 
5 bacteria which are normally present in or on the body of the organism, but more 
generally, a bacterial infection can be any situation in which the presence of a 
bacterial population(s) is damaging to a host organism. Thus, for example, an 
organism suffers from a bacterial infection when excessive numbers of a bacterial 
population are present in or on the organism's body, or when the effects of the 
10 presence of a bacterial population(s) is damaging to the cells, tissue, or organs of the 
organism. 

The terms "administer", "administering", and "administration" refer to a 
method of giving a dosage of a compound or composition, e.g., an antibacterial 
pharmaceutical composition, to an organism. Where the organism is a mammal, the 

15 method is, e.g., topical, oral, intravenous, transdermal, intraperitoneal, 

intramuscular, or intrathecal. The preferred method of administration can vary 
depending on various factors, e.g., the components of the pharmaceutical 
composition, the site of the potential or actual bacterial infection, the bacterium 
involved, and the infection severity. 

20 The term "mammal" has its usual biological meaning, referring to any 

organism of the Class Mammalia of higher vertebrates that nourish their young with 
milk secreted by mammary glands, e.g., mouse, rat, and, in particular, human, 
bovine, sheep, swine, dog, and cat. 

In the context of treating a bacterial infection a "therapeutically effective 

25 amount" or "pharmaceutically effective amount" indicates an amount of an 

antibacterial agent, e.g., as disclosed for this invention, which has a therapeutic 
effect. This generally refers to the inhibition, to some extent, of the normal cellular 
functioning of bacterial cells that renders or contributes to bacterial infection. 
The dose of antibacterial agent that is useful as a treatment is a 

30 "therapeutically effective amount." Thus, as used herein, a therapeutically effective 
amount means an amount of an antibacterial agent that produces the desired 
therapeutic effect as judged by clinical trial results and/or animal models. This 
amount can be routinely determined by one skilled in the art and will vary depending 
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on several factors, such as the particular bacterial strain involved and the particular 
antibacterial agent used. 

In connection with claims to methods of inhibiting bacteria and therapeutic or 
prophylactic treatments, "a compound active on a target of a bacteriophage inhibitor 
protein" or terms of equivalent meaning differ from administration of or contact with 
an intact phage naturally encoding the full-length inhibitor compound. While an 
intact phage may conceivably be incorporated in the present methods, the method at 
least includes the use of an active compound as specified different from a full length 
inhibitor protein naturally encoded by a bacteriophage and/or a delivery or 
contacting method different from administration of or contact with an intact phage 
naturally encoding the full-length protein. Similarly, pharmaceutical compositions 
described herein at least include an active compound or composition different from a 
phage naturally coding the full-length inhibitor protein, or such a full-length protein 
is provided in the composition in a form different from being encoded by an intact 
phage. Preferably the methods and compositions do not include an intact phage. 

In accordance with the above aspects, the invention also provides antibacterial 
agents and compounds active on a bacterial target of bacteriophage ORF 17, 19, 43, 
102, 104, or 182, where the target was uncharacterized as indicated above. As 
previously indicated, such active compounds include both novel compounds and 
compounds which had previously been identified for a purpose other than inhibition 
of bacteria. Such previously identified biologically active compounds can be used in 
embodiments of the above methods of inhibiting and treating. In preferred 
embodiments, the targets, bacteriophage, and active compound are as described 
herein for methods of inhibiting and methods of treating. Preferably the agent or 
compound is formulated in a pharmaceutical composition which includes a 
pharmaceutically acceptable carrier, excipient, or diluent. In addition, the invention 
provides agents, compounds, and pharmaceutical compositions where an active 
compound is active on an uncharacterized phage-specific site on the target. 

In preferred embodiments, the target is as described for embodiments of 
aspects above. 

Likewise, the invention provides a method of making an antibacterial agent. 
The method involves identifying a target of a bacteriophage 77 ORF 17, 19, 43, 102, 
104, or 182 product, screening a plurality of compounds to identify a compound 
active on the target, and synthesizing the compound in an amount sufficient to 
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provide a therapeutic effect when administered to an organism infected by a 
bacterium naturally producing the target. 

In preferred embodiments, the identification of the target and identification of 
active compounds include steps or methods and/or components as described above 
5 (or otherwise herein) for such identification. Likewise, the active compound can be 
as described above, including fragments and derivatives of phage inhibitor proteins, 
peptidomimetics, and small molecules. As recognized by those skilled in the art, 
peptides can be synthesized by expression systems and purified, or can be 
synthesized artificially by methods well known in the art. 
10 In the context of nucleic acid or amino acid sequences of this invention, the 

term "corresponding" indicates that the sequence is at least 95% identical, preferably 
at least 97% identical, and more preferably at least 99% identical to a sequence from 
the specified phage genome or bacterial genome, a ribonucleotide equivalent, a 
degenerate equivalent (utilizing one or more degenerate codons), or a homologous 
15 sequence, where the homolog provides functionally equivalent biological function. 

/Iiranbodimenis where tiie bacterial target or a bacteridphage^mhibiioP^RF — 
product, e.g., an inhibitory protein or polypeptide, the targeHs^referably encoded 
by a S. aureus nucleic acid coding sequence fropa-lfhost bacterium for bacteriophage 
77. Target sequences are described herein*by reference to sequence source sites. 
20 The sequence encoding the targ^^referably corresponds to a S. aureus nucleic acid 
sequence available from numerous sources including S. aureus sequences deposited 
in GenBank, S. awre^/sequences found in European Patent Application No. 
97100110.7 to IKunan Genome Sciences, Inc. filed January 7, 1997, 5. aureus 
sequences^availabie from TIGR at http://vAvw.ti^r.org/tdb/mdb/mdb.htmL and S. 
25 aurej^ysequences available from the Oklahoma University S. aureus sequencing 

r gcnomc.ou ^du/st aph ne wiitml. 
The amino acid sequence of a polypeptide target is readily provided by 
translating the corresponding coding region. For the sake of brevity, the sequences 
are not reproduced herein. Also, in preferred embodiments, a target sequence 
30 corresponds to a S. aureus coding sequences corresponding to a sequence listed in 
Table 6 herein. The listing in Table 6 describes S. aureus sequences currently 
deposited in GenBank. Again, for the sake of brevity, the sequences are described 
by reference to the GenBank entries instead of being written out in full herein. In 
cases where an entry for a coding region is not complete, the complete sequence can 
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be readily obtained by routine methods, e.g., by isolating a clone in a phage 77 host 
5. aureus genomic library, and sequencing the clone insert to provide the relevant 
coding region. The boundaries of the coding region can be identified by 
conventional sequence analysis and/or by expression in a bacterium in which the 
5 endogenous copy of the coding region has been inactivated and using subcloning to 
identify the functional start and strop codons for the coding region. 

As used in the claims to describe the various inventive aspects and 
embodiments, "comprising" means including, but not limited to, whatever follows 
10 the word "comprising". Thus, use of the term "comprising" indicates that the listed 
elements are required or mandatory, but that other elements are optional and may or 
may not be present. By "consisting of is meant including, and limited to, whatever 
follows the phrase "consisting of 1 . Thus, the phrase "consisting of 1 indicates that 
the listed elements are required or mandatory and that no other elements may be 
15 present. By "consisting essentially of* is meant including any elements listed after 
the phrase, and limited to other elements that do not interfere with or contribute to 
the activity or action specified in the disclosure for the listed elements. Thus, the 
J:: phrase "consisting essentially of 1 indicates that the listed elements are required or 

mandatory, but that other elements are optional and may or may not be present 
%!;S 20 depending upon whether or not they affect the activity or action of the listed 

a: x 
C : r 

elements. 



Additional features and embodiments of the present invention will be apparent 
from the following Detailed Description and from the claims, all within the scope of 
25 the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 



FIGURE 1 is a flow schematic showing the manipulations necessary to convert 
30 pT0021, an arsenite inducible vector containing the luciferase gene, into pTHA or 
pTM, two ars inducible vectors. Vector pTHA contains BamH I, Sal I, and Hind m 
cloning sites and a downstream HA epitope tag. Vector pTM contains Bam HI and 
Hind HI cloning sites and no HA epitope tag. 
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FIGURE 2 is a schematic representation of the cloning steps involved to place the 
DNA segments of any of ORFs 17/ 19/ 43/ 102/104/182 or other sequences into pTHA to 
assess inhibitory potential. For subcloning into pTM or pT0021, Individual ORFs were 
amplified by the PCR using oligonucleotides targeting the ATG and stop codons of the 
5 ORFs. Using this strategy, Bam HI and Hind m sites were positioned immediately 

upstream or downstream, respectively of the start and stop codons of each ORF. Following 
digestion with Bam HI and Hind HI, the PCR fragments were subcloned into the same sites 
of pT0021 or pTM. Clones were verified by PCR and direct sequencing. 

10 FIGURE 3 shows a schematic representation of the functional assays used to 

characterize the bactericidal and bacteriostatic potential of all predicted ORFs (>33 
amino acids) encoded by bacteriophage 77. Fig. 3A) Functional assay on semi-solid 
support media. Fig. 3B) Functional assay in liquid culture. 

f! 

vr.r 

\KI 1 5 FIGURE 4 shows the results of a screen to assess killing potential of bacteriophage 



20 containing a given ORF. The number on the grid refers to the ORF contained within 
a given Staph A. clone and the number following the dash identifies the different 
transformants. For example, 5-3 and 5-1 refers to ORF 5, transformant #3 and ORF 
5, transformant #1, respectively. A full darkened bar under the name of the Staph 
clone infers that an undiluted sample of the overnight culture was spotted onto the 



25 agar plate; whereas a shortened bar indicates that a 10-fold diluted sample of the 
overnight culture was spotted onto the agar plate. II) An overnight culture of the 
Staphylococcus aureus transformant was spotted onto directly, or after a 10-fold 
diluted, onto an agar plate lacking arsenite. Ill) II) An overnight culture of the 
Staphylococcus aureus transformant was spotted onto directly, or after a 10-fold 

30 diluted, onto an agar plate containing 5 |iM arsenite, to induce expression of the ars 
promoter and hence of the bacteriophage 77 ORF. From this experiment, it is clear 
that expression of bacteriophage ORF 19 (PanelA) and ORF 182 (Panel C) lead to 
inhibition of growth of Staphylococcus aureus. Positive controls for this experiment 



77 encoded ORFs (>33 amino acids) on semi-solid support media. For each ORF 
tested between one to three independent transformants were tested at two different 
concentrations. Each panel (A-H) consists of 3 sections: I) A grid showing the 
position of spotting of individual Staphylococcus aureus transformant clones 
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included expression of the lysis cassette from bacteriophage 77 (Panel H, Lys-1, 
Lys-2, Lys-3) and the lysis cassette from bacteriophage 77 (TH-1, TH-2, TH-3). 



3 



FIGURE 5 shows the results of a screen to assess killing potential of bacteriophage 
5 77 encoded ORFs (>33 amino acids) on semi-solid support media. In this 

experiment, clones of Staphylococcus aureus harboring a given bacteriophage 77 
ORF were streaked on plates lacking or containing 5 jiM arsenite. Following 
incubation of the plates at 37°C overnight, inhibition of growth was visually 
assessed. The identity of the expression vector and subcloned ORF harbored by the 
10 Staphylococcus transformant is given around the border of each plate. From this 
experiment, it is clear that expression of bacteriophage ORF 17 (Panel A), ORF 43 
(Panel B), ORF 102, (Panel B and C), and ORF 104 (Panel D) lead to inhibition of 
growth of Staphylococcus aureus. 



% 15 FIGURE 6 is a bar graph showing the results of a screen in liquid media to assess 
\J bacteriostatic or bactericidal activity of 93 predicted ORFs (>33 amino acids) 

encoded by bacteriophage 77. Growth inhibition assays were performed as detailed 
in the Detailed Description. The relative growth of Staphylococcus aureus 
transformants harboring a given bacteriophage 77 ORF (identified on the bottom of 
20 the graph), in the absence or presence of arsenite, is plotted relative to growth of a 
Staphylococcus aureus transformant containing ORF 5, a non-toxic bacteriophage 77 
ORF (which is set at 100%). Each bar represents the average obtained from three 
Staph A transformants grown in duplicate. Bacteriophage 77 ORFs showing 
significant growth inhibition are plotted in red and consist or ORF 17, 19, 102, 104, 
25 and 182. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Preliminarily the tables will be briefly described. 

Table 1 shows the complete nucleotide sequence of the genome of 
30 Staphylococcus aureus bacteriophage 77. 

Table 2 is a table from Alberts et al., MOLECULAR BIOLOGY OF THE 
CELL 3 rd ed., showing the redundancy of the "universal" genetic code. 

Table 3 FIGURE 3 shows the predicted nucleotide sequence, predicted 
amino acid sequence, and physiochemical parameters of ORF 17/19/ 43/ 102/ 104/ 
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182]. These include the primary amino acid sequence of the predicted protein, the average 
molecular weight, amino acid composition, theoretical pi, hydrophobicity map, and 
predicted secondary structure map. 

Table 4 FIGURE 4 Shows homology search results. BLAST analysis was 
5 performed with ORFs 17/ 19/ 43/ 102/ 104/ 182 against NCBI non-redundant 

nucleotide and Swissprot databases. The results of this search indicate that: I) ORF 
17 has no significant homology to any gene in the NCBI non-NCBI non-redundant 
nucleotide database, II) ORF 19 has significant homology to one gene in the NCBI 
non-redundant nucleotide database - the gene encoding ORF 59 of bacteriophage phi 
10 PVL, IE) ORF 43 has significant homology to one gene in the NCBI non-redundant 
nucleotide database - the gene encoding ORF 39 of phi PVL, IV) ORF 102 has 
significant homology to one gene in the NCBI non-redundant nucleotide database - 
the gene encoding ORF 38 of phi PVL, V) ORF 104 has no significant homology to 
p any gene in the NCBI non-redundant nucleotide database, VI) ORF 1 82 has 

1 5 significant homology to one gene in the NCBI non-redundant nucleotide database - 
Q the gene encoding ORF 39 of phi PVL. 

Table 5 FIGURE 7 shows a list of all the ORFs from Bacteriophage 77 that 
were screened in the functional assay to identify those with anti-microbial activity. 
Table 6 is a list of Staphylococcus aureus sequences which may represent 
20 genes coding for target sequences for the phage 77-encoded antimicrobial proteins or 
peptides. 

jfcFhgfoeseiil i nventi o n ib babod o n Hie identifi c ati o n of na tur ally oceufll gg^- 
DNA sequence elements encoding RNA or proteins with anti-nji^ftJBi^activity. 
Bacteriophages or phages, are viruses that infect andJdfloacteria. They are natural 
enemies of bacteria and, over the coursepf^volution have perfected enzymes 
(products of DNA sequences) wjiklienable them to infect a host bacteria, replicate 
their genetic material, usjjurfHiost metabolism, and ultimately kill their host. The 
scientific literaturp^documents well the fact that many known bacteria have a large 
30 number of sjnJn bacteriophages that can infect and kill them ( for example, see the 
ATCC bacteriophage collection at http://www.atcc.org) (Ackermann and DuBow, 
1987> Although we know that many bacteriophages encode proteins which can 
"il fcnifipnntly alter their hmt'n mtiiiHiliflmjTfHm nrm^ Irilling pntrntin 1 nfn 
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As indicated in the Summary above, the present invention is concerned with 



the use of bacteriophage 77 coding sequences and the encoded polypeptides or RNA 
5 transcripts to identify bacterial targets for potential new antibacterial agents. Thus, 
the invention concerns the selection of relevant bacteria. Particularly relevant 
bacteria are those which are pathogens of a complex organism such as an animal, 
e.g., mammals, reptiles, and birds, and plants. However, the invention can be 
applied to any bacterium (whether pathogenic or not) for which bacteriophage are 

10 available or which are found to have cellular components closely homologous to 
components targeted by phage 77 ORFs 17, 19, 43, 102, 104, and 182. 

Identification of ORFs 17, 19, 43, 102, 104, and 182 and products from the 
phage which inhibit the host bacterium both provides an inhibitor compound and 
allows identification of the bacterial target affected by the phage-encoded inhibitor. 

15 Such a target is thus identified as a potential target for development of other 
antibacterial agents or inhibitors and the use of those targets to inhibit those bacteria. 
As indicated above, even if such a target is not initially identified in a particular 
bacterium, such a target can still be identified if a homologous target is identified in 
another bacterium. Usually, but not necessarily, such another bacterium would be a 

20 genetically closely related bacterium. Indeed, in some cases, an inhibitor encoded 
by phage 77 ORF 17, 19, 43, 102, 104, or 182 can also inhibit such a homologous 
bacterial cellular component. 

The demonstration that bacteriophage have adapted to inhibiting a host 
bacterium by acting on a particular cellular component or target provides a strong 

25 indication that that component is an appropriate target for developing and using 
antibacterial agents, e.g., in therapeutic treatments. Thus, the present invention 
provides additional guidance over mere identification of bacterial essential genes, as 
the present invention also provides an indication of accessibility of the target to an 
inhibitor, and an indication that the target is sufficiently stable over time {e.g., not 

30 subject to high rates of mutation) as phage acting on that target were able to develop 
and persist. Thus, the present invention identifies a particular subset of essential 
cellular components which are particularly likely to be appropriate targets for 
development of antibacterial agents. 
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The invention also, therefore, concerns the development or identification of 
inhibitors of bacteria, in addition to the phage-encoded inhibitory proteins (or RNA 
transcripts), which are active on the targets of bacteriophage-encoded inhibitors. As 
described herein, such inhibitors can be of a variety of different types, but are 
5 preferably small molecules. 

The following description provides preferred methods for implementing the 
various aspects of the invention. However, as those skilled in the art will readily 
recognize, other approaches can be used to obtain and process relevant information. 
Thus, the invention is not limited to the specifically described methods. In addition, 

10 the following description provides a set of steps in a particular order. That series of 
steps describes the overall development involved in the present invention. However, 
it is clear that individual steps or portions of steps may be usefully practiced 
separately, and, further, that certain steps may be performed in a different order or 
even bypassed if appropriate information is already available or is provided by other 

15 sources or methods. 

Identification of Inhibitory ORF 

The methodology previously described in U.S. Provisional Patent 
Application NO. 60/1 10,992, and filed December 3, 1998, was used to to identify 

20 and characterize DNA sequences from Staphylococcus aureus bacteriophage 77 that 
can act as anti-microbials. A nucleic acid segment isolated from Staphylococcus 
aureus bacteriophage 77 encodes a protein, whose gene is referred to as ORF (open 
reading frame) 17, 19, 43, 102, 104, or 182. Thus, the present invention provides a 
nucleic acid sequence isolated from Staphylococcus aureus (Staph A or 5. aureus) 

25 bacteriophage 77 comprising at least a portion of the gene encoding ORF 17, 19, 43, 
102, 104, or 182 with anti-microbial activity. The nucleic acid sequence can be 
isolated using a method similar to those described herein, or using another method. 
In addition, such a nucleic acid sequence can be chemically synthesized. Having the 
anti-microbial nucleic acid sequence of the present invention, parts thereof or 

30 oligonucleotides derived therefrom, other anti-microbial sequences from other 
bacteriophage sources using methods described herein or other methods can be 
isolated, including screening methods based on nucleic acid sequence hybridization. 

The present invention provides the use of the Staph A bacteriophage 77 anti- 
microbial DNA segment encoding ORF 17, 19, 43, 102, 104, or 182], as a 

35 pharmacological agent - either wholly or in part - as well as the use of 

SD-110719.1 



28 




Patent 
241/190 



peptidomimetics, developed from amino acid or nucleotide sequence knowledge of 
Staph A bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182. This can be achieved 
where the structure of the peptidomimetic compound corresponds to the structure of 
the active portion of ORF 17, 19, 43, 102, 104, or 182. In this analysis, the peptide 
backbone is transformed into a carbon-based hydrophobic structure that can retain 
cytostatic or cytocidal activity for the bacterium. This is done by standard medicinal 
chemistry methods, measuring growth inhibition of the various molecules in liquid 
cultures or on solid medium. These mimetics also represent lead compounds for the 
development of novel antibiotics. 

In this context, "corresponds" means that the peptidomimetic compound 
structure has sufficient similarities to the structure of the active portion of ORF 17, 
19, 43, 102, 104, or 182 that the peptidomimetic will interact with the same 
molecule as the product of ORF 17, 19, 43, 102, 104, or 182 and preferably will 
elicit at least one cellular response in common which relates to the inhibition of the 
cell by the phage protein. 

The invention also provides bacteriophage anti-microbial DNA segments 
from other phages based on nucleic acids and sequences hybridizing to the presently 
identified inhibitory ORF under high stringency conditions or sequences which are 
homologous as described above. The bacteriophage anti-microbial DNA segment 
from bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182 can be used to identify a 
related segment from another related or unrelated phage based on conditions of 
hybridization or sequence comparison. 

Identification of Bacterial Targets 

The present invention provides the use of Staphylococcus bacteriophage 77 
ORFs 17, 19, 43, 102, 104, and 182 anti-microbial activity to identify essential host 
bacterium interacting proteins or other targets that could, in turn, be used for drug 
design and/or screening of test compounds. Thus, the invention provides a method of 
screening for antibacterial agents by determining whether test compounds interact 
with (e.g., bind to) the bacterial target. The invention also provides a method of 
making an antibacterial agent based on production and purification of the protein or 
RNA product of bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182. The method 
involves identifying a bacterial target of the product of ORF 17, 19, 43, 102, 104, or 
182, screening a plurality of compounds to identify a compound active on the target, 
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and synthesizing the compound in an amount sufficient to provide a therapeutic 
effect when administered to an organism infected by a bacterium naturally producing 
the target. The rationale is that the product of ORFs 17, 19, 43, 102, 104, and 182 
can physically interact and/or modify certain microbial host components to block 
5 their function. 

A variety of methods are known to those skilled in the art for identifying 
interacting molecules and for identifying target cellular components. Several 
approaches and techniques are described below which can be used to identify the 
host bacterial pathway and protein that interact or are inhibited by ORF 17, 19, 43, 
10 102, 104, or 182. 

The first approach is a genetic screen for protein:protein interaction, e.g., 
either some form of two hybrid screen or some form of suppressor screen. In one 
form of the two hybrid screen involving the yeast two hybrid system, the nucleic 
acid segment encoding ORF 17, 19, 43, 102, 104, or 182], or a portion thereof, is 
15 fused to the carboxyl terminus of the yeast Gal4 activation domain II (amino acids 
768-881) to create a bait vector. A cDNA library of cloned S. aureus sequences 
CSi which have been engineered into a plasmid where the S. aureus sequences are fused 

j« to the DNA binding domain of Gal4 is also generated. These plasmids are 

" introduced alone, or in combination, into a yeast strain, e.g., Y190, previously 

±q 20 engineered with chromosomally integrated copies of the E. coli lacZ and the 

a: : 

selectable His3 genes, both under Gal4 regulation (Durfee et al., 1993). If the two 
WiJ proteins expressed in yeast interact, the resulting complex will activate transcription 

from promoters containing Gal4 binding sites. A lacZ and His3 gene, each driven by 
a promoter containing Gal4 binding sites, have been integrated into the genome of 
25 the host yeast system and are used for measuring protein-protein interactions. Such a 
system provides a physiological environment in which to detect potential protein 
interactions. 

This system has been extensively used to identify novel protein-protein 
interaction partners and to map the sites required for interaction (for example, to 
30 identify interacting partners of translation factors (Qui et al., 1998), transcription 
factors (Katagiri et al., 1998), proteins involved in signal transduction (Endo et al., 
1997). Alternatively, a bacterial two-hybrid screen can be utilized to circumvent the 
need for the interacting proteins to be targeted to the nucleus, as is the case in the 
yeast system (Karimova et al., 1998). 

SD-1 10719.1 



30 Patent 

241/190 



The protein targets of ORFs 17, 19, 43, 102, 104, and 182 can also be 
identified using bacterial genetic screens. One approach involves the overexpression 
of ORF 17, 19, 43, 102, 104, or 182 protein in mutagenized S. aureus followed by 
plating the cells and searching for colonies that can survive the anti-microbial 
5 activity of ORF 17, 19, 43, 102, 104, or 182. These colonies are then grown, their 
DNA extracted, and cloned into an expression vector that contains a replicon of a 
different incompatibility group from the plasmid expressing ORF 17, 19, 43, 102, 
104, or 182. This library is then introduced into a wild-type Staph A bacterium in 
conjunction with an expression vector driving synthesis of ORF 17, 19, 43, 102, 
10 104, or 182, followed by selection for surviving bacteria. Thus, Staph A DNA 

fragments from the survivors presumably contain a DNA fragment from the original 
mutagenized Staph A genome that can protect the cell from the antimicrobial activity 
of ORF 17, 19, 43, 102, 104, or 182. This fragment can be sequenced and compared 
w with that of the bacterial host to determine in which gene the mutation lies. This 

4= 1 5 approach enables one to determine the targets and pathways that are affected by the 
killing function. 

tO Alternatively, the bacterial targets can be determined in the absence of 

j;; selecting for mutations using the approach known as "multicopy suppression 11 . In 

jL this approach, the DNA from the wild type Staph A host is cloned into an expression 

\§ 20 vector that can coexist with the one containing ORF 17, 19, 43, 102, 104, or 182. 
pi Those plasmids that contain host DNA fragments and genes which protect the host 

*9 from the anti microbial activity of ORF 17, 19, 43, 102, 104, or 182 can then be 

isolated and sequenced to identify putative targets and pathways in the host bacteria. 
Another approach is based on identifying proteimprotein interactions 
25 between the product of ORF 17, 19, 43, 102, 104, or 182 and S. aureus host proteins, 
using a biochemical approach based on affinity chromatography. This approach has 
been used to identify interactions between lambda phage proteins and proteins from 
their E. coli host (Sopta et al., 1995). The product of ORF 17, 19, 43, 102, 104, or 
182 is fused to a tag (e.g. -glutathione-S-transferase) after insertion in a 
30 commercially available plasmid vector which directs high-level expression after 

induction of the responsive promoter driving the fusion protein. The fusion protein is 
expressed in E. coli, purified, and immobilized on a solid phase matrix. Total cell 
extracts from 5. aureus are then passed through the affinity matrix containing the 
immobilized phage ORF fusion protein; host proteins retained on the column are 
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then eluted under different conditions of ionic strength, pH, and detergents and 
identified by gel electrophoresis. They are recovered from the gel by transfer to a 
high affinity membrane. The proteins are individually digested to completion with a 
protease (e.g.-trypsin) and either molecular mass or the amino acid sequence of the 
tryptic fragments can be determined by mass spectrometry using MALDI-TOF 
technology (Qin et al., 1997). The sequence of the individual peptides from a single 
protein are then analyzed by a bioinformatics approach to identify the 5. aureus 
protein interacting with the phage ORF. This is performed by a computer search of 
the S. aureus genome for the identified sequence. Alternatively, tryptic peptide 
fragments of the S. aureus genome can be predicted by computer software based on 
the nucleotide sequence of the genome, and the predicted molecular mass of peptide 
fragments generated in silico compared to the molecular mass of the peptides 
obtained from each interacting protein eluted from the affinity matrix. 

In addition, an oligonucleotide cocktail can be synthesized based on the 
primary amino acid sequence determined for an interacting S. aureus protein 
fragment. This oligonucleotide cocktail would comprise a mixture of 
oligonucleotides based on the nucleotide sequences of the primary amino acid of the 
predicted peptide, but in which all possible codons for a particular amino acid 
sequence are present in a subset of the oligonucleotide pool. This cocktail can then 
be used as a degenerate probe set to screen, by hybridization to genomic or cDNA 
libraries, to isolate the corresponding gene. 

Alternatively, antibodies raised to peptides which correspond to an 
interacting S. aureus protein fragment can be used to screen expression libraries 
(genomic or cDNA) to identify the gene encoding the interacting protein. 



The invention also provides vectors, preferably expression vectors, harboring 
the anti-microbial DNA nucleic acid segment of the invention in an expressible 
form, and cells transformed with the same. Such cells can serve a variety of 
purposes, such as in vitro models for the function of the anti-microbial nucleic acid 
segment and screening for downstream targets of the anti-microbial nucleic acid 
segment, as well as expression to provide relatively large quantities of the inhibitory 
product. 



Vectors 
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Thus, an expression vector harboring the anti-microbial nucleic acid segment 
or parts thereof {Staph A bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182) can 
also be used to obtain substantially pure protein. Well-known vectors, such as the 
pGEX series (available from Pharmacia), can be used to obtain large amounts of the 
5 protein which can then be purified by standard biochemical methods based on 

charge, molecular mass, solubility, or affinity selection of the protein by using gene 
fusion techniques (such as GST fusion, which permits the purification of the protein 
of interest on a glutathione column). Other types of purification methods or fusion 
proteins could also be used as recognized by those skilled in the art. 
10 Likewise, vectors containing bacteriophage 77 ORFs 17, 19, 43, 102, 104, 

and 182 can be used in methods for identifying targets of the encoded antibacterial 
ORF product, e.g., as described above, and/or for testing inhibition of homologous 
bacterial targets or other potential targets in bacterial species other than 
O Staphylococcus aureus. 

£ 15 

O 

\A 

•s 
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Antibodies 

Antibodies, both polyclonal and monoclonal, can be prepared against the 
protein encoded by a bacteriophage anti-microbial DNA segment of the invention 

L (e.g., Staph A bacteriophage 77 ORF 17, 19, 43, 102, 104, or 182) by methods well 

O 

y;| 20 known in the art. Protein for preparation of such antibodies can be prepared by 
purification, usually from a recombinant cell expressing the specified ORF or 

ViJ fragment thereof. Those skilled in the art are familiar with methods for preparing 

polyclonal or monoclonal antibodies (See, e.g., Antibodies: A Laboratory Manual. 
Harlow and Lane, Cold Spring Harbor Laboratory, CSHL Press, N.Y., 1988). 
25 Such antibodies can be used for a variety of purposes including affinity 

purification of the protein encoded by the bacteriophage anti-microbial DNA 
segment, tethering of the protein encoded by the bacteriophage anti-microbial DNA 
segment to a solid matrix for purposes of identifying interacting host bacterium 
proteins, and for monitoring of expression of the protein encoded by the 
30 bacteriophage anti-microbial DNA segment. 



Recombinant Cells 

Bacterial cells containing an inducible vector regulating expression of the 
bacteriophage anti-microbial DNA segment can be used to generate an animal model 
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system for the study of infection by the host bacterium. The functional activity of the 
proteins encoded by the bacteriophage anti-microbial DNA segments, whether 
native or mutated, can be tested in animal in vitro or in vivo models. 

While such cells containing inducible expression vectors is preferred, other 
5 recombinant cells containing a recombinant phage 77 ORF 17, 19, 43, 102, 104, or 
182 sequence or portion thereof are also provided by the present invention. 

Also, a recombinant cell may contain a recombinant sequence encoding at 
least a portion of a protein which is a target of phage 77 ORF 17, 19, 43, 102, 104, 
or 182 inhibitory ORF product. 
10 In the context of this invention, in connection with nucleic acid sequences, 

the term "recombinant" refers to nucleic acid sequences which have been placed in a 
genetic location by intervention using molecular biology techniques, and does not 
include the relocation of phage sequences during or as a result of phage infection of 
¥«) a bacterium or normal genetic exchange processes such as bacterial conjugation. 

15 

Derivatization of identified anti-microbials 



ru 



In cases where the identified anti-microbials above are peptidic compounds, 
the in vivo effectiveness of such compounds may be advantageously enhanced by 

20 chemical modification using the natural polypeptide as a starting point and 
incorporating changes that provide advantages for use, for example, increased 
stability to proteolytic degradation, reduced antigenicity, improved tissue 
penetration, and/or improved delivery characteristics. 

In addition to active modifications and derivative creations, it can also be 

25 useful to provide inactive modifications or derivatives for use as negative controls or 
introduction of immunologic tolerance. For example, a biologically inactive 
derivative which has essentially the same epitopes as the corresponding natural 
antimicrobial can be used to induce immunological tolerance in a patient being 
treated. The induction of tolerance can then allow uninterrupted treatment with the 

30 active anti-microbial to continue for a significantly longer period of time. 

Modified anti-microbial polypeptides and derivatives can be produced using 
a number of different types of modifications to the amino acid chain. Many such 
methods are known to those skilled in the art. The changes can include, for 
example, reduction of the size of the molecule, and/or the modification of the amino 
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acid sequence of the molecule. In addition, a variety of different chemical 
modifications of the naturally occurring polypeptide can be used, either with or 
without modifications to the amino acid sequence or size of the molecule. Such 
chemical modifications can, for example, include the incorporation of modified or 
non-natural amino acids or non-amino acid moieties during synthesis of the peptide 
chain, or the post-synthesis modification of incorporated chain moieties. 

The oligopeptides of this invention can be synthesized chemically or through 
an appropriate gene expression system. Synthetic peptides can include both 
naturally occurring amino acids and laboratory synthesized, modified amino acids. 

Also provided herein are functional derivatives of anti-microbial proteins or 
polypeptides. By "functional derivative" is meant a "chemical derivative," 
"fragment," "variant," "chimera," or "hybrid" of the polypeptide or protein, which 
terms are defined below. A functional derivative retains at least a portion of the 
function of the protein, for example, reactivity with a specific antibody, enzymatic 
activity or binding activity. 

A "chemical derivative" of the complex contains additional chemical 
moieties not normally a part of the protein or peptide. Such moieties may improve 
the molecule's solubility, absorption, biological half-life, and the like. The moieties 
may alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, and the like. Moieties capable of mediating 
such effects are disclosed in Alfonso and Gennaro (1995). Procedures for coupling 
such moieties to a molecule are well known in the art. Covalent modifications of 
the protein or peptides are included within the scope of this invention. Such 
modifications may be introduced into the molecule by reacting targeted amino acid 
residues of the peptide with an organic derivatizing agent that is capable of reacting 
with selected side chains or terminal residues, as described below. 

Cysteinyl residues most commonly are reacted with alpha-haloacetates (and 
corresponding amines), such as chloroacetic acid or chloroacetamide, to give 
carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl residues also are 
derivatized by reaction with bromotrifluoroacetone, chloroacetyl phosphate, N- 
alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p-chloro- 
mercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3- 
diazole. 
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Histidyl residues are derivatized by reaction with diethylprocarbonate at pH 
5.5-7.0 because this agent is relatively specific for the histidyl side chain. Para- 
bromophenacyl bromide also is useful; the reaction is preferably performed in 0.1 M 
sodium cacodylate at pH 6.0. 
5 Lysinyl and amino terminal residues are reacted with succinic or other 

carboxylic acid anhydrides. Derivatization with these agents has the effect of 
reversing the charge of the lysinyl residues. Other suitable reagents for derivatizing 
primary amine- containing residues include imidoesters such as methyl 
picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; 
10 trinitrobenzenesulfonic acid; O-methylisourea; 2,4 pentanedione; and transaminase- 
catalyzed reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional 
reagents, among them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and 
ninhydrin. Derivatization of arginine residues requires that the reaction be 
15 performed in alkaline conditions because of the high pK^ of the guanidine functional 
group. Furthermore, these reagents may react with the groups of lysine as well as 

18 the arginine alpha-amino group. 

£3 

jz Tyrosyl residues are well-known targets of modification for introduction of 

* spectral labels by reaction with aromatic diazonium compounds or 

y!i 20 tetranitromethane. Most commonly, N-acetylimidizol and tetranitromethane are 



used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by 
reaction carbodiimide (R'-N-C-N-R 1 ) such as l-cyclohexyl-3-(2-morpholinyl(4- 
ethyl) carbodiimide or l-ethyi-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. 
25 Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and 
glutaminyi residues by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues are 
deamidated under mildly acidic conditions. Either form of these residues falls 
30 within the scope of this invention. 

Derivatization with Afunctional agents is useful, for example, for cross- 
linking component peptides to each other or the complex to a water-insoluble 
support matrix or to other macromolecular carriers. Commonly used cross-linking 
agents include, for example, 1,1 -bis (diazoacetyl)-2-phenylethane, glutaraldehyde, 
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N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), and bifiinctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[p-azidophenyl) 
5 dithiolpropioimidate yield photoactivatable intermediates that are capable of forming 
crosslinks in the presence of light. Alternatively, reactive water-insoluble matrices 
such as cyanogen bromide-activated carbohydrates and the reactive substrates 
described in U.S. Patent Nos. 3,969,287; 3,691,016; 4,195,128; 4,247,642; 
4,229,537; and 4,330,440 are employed for protein immobilization. 

10 Other modifications include hydroxylation of proline and lysine, 

phosphorylation of hydroxy 1 groups of seryl or threonyl residues, methylation of the 
alpha-amino groups of lysine, arginine, and histidine side chains (Creighton, T.E., 
Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)), acetylation of the N-terminal amine, and, in some instances, 

1 5 amidation of the C-terminal carboxyl groups. 

Such derivatized moieties may improve the stability, solubility, absorption, 
biological half-life, and the like. The moieties may alternatively eliminate or 
attenuate any undesirable side effect of the protein complex. Moieties capable of 
mediating such effects are disclosed, for example, in Alfonso and Gennaro (1995). 

20 The term "fragment" is used to indicate a polypeptide derived from the 

amino acid sequence of the protein or polypeptide having a length less than the full- 
length polypeptide from which it has been derived. Such a fragment may, for 
example, be produced by proteolytic cleavage of the full-length protein. Preferably, 
the fragment is obtained recombinantly by appropriately modifying the DNA 

25 sequence encoding the proteins to delete one or more amino acids at one or more 
sites of the C-terminus, N-terminus, and/or within the native sequence. 

Another functional derivative intended to be within the scope of the present 
invention is a "variant" polypeptide which either lacks one or more amino acids or 
contains additional or substituted amino acids relative to the native polypeptide. The 

30 variant may be derived from a naturally occurring polypeptide by appropriately 
modifying the protein DNA coding sequence to add, remove, and/or to modify 
codons for one or more amino acids at one or more sites of the C-terminus, N- 
terminus, and/or within the native sequence. 
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A functional derivative of a protein or polypeptide with deleted, inserted 
and/or substituted amino acid residues may be prepared using standard techniques 
well-known to those of ordinary skill in the art. For example, the modified 
components of the functional derivatives may be produced using site-directed 
5 mutagenesis techniques (as exemplified by Adelman et aL, 1983, DNA 2:183; 
Sambrook et al., 1989) wherein nucleotides in the DNA coding sequence are 
modified such that a modified coding sequence is produced, and thereafter 
expressing this recombinant DNA in a prokaryotic or eukaryotic host cell, using 
techniques such as those described above. Alternatively, components of functional 
10 derivatives of complexes with amino acid deletions, insertions and/or substitutions 
may be conveniently prepared by direct chemical synthesis, using methods well- 
known in the art. 

p Insofar as other anti-microbial inhibitor compounds identified by the 

v3 invention described herein may not be peptidal in nature, other chemical techniques 

p 1 5 exist to allow their suitable modification, as well, and according the desirable 
principles discussed above. 

G 



3 

O 20 



Administration and Pharmaceutical Compositions 

For the therapeutic and prophylactic treatment of infection, the preferred 



method of preparation or administration of anti-microbial compounds will generally 
vary depending on the precise identity and nature of the anti-microbial being 
delivered. Thus, those skilled in the art will understand that administration methods 

25 known in the art will also be appropriate for the compounds of this invention. 

Pharmaceutical compositions are prepared, as understood by those skilled in the art, 
to be appropriate for therapeutic use. Thus, generally the components and 
composition are prepared to be sterile and free of components or contaminants 
which would pose an unacceptable risk to a patient. For compositions to be 

30 administered internally is is generally important that the composition be pyrogen 
free, for example. 

The particularly desired anti-microbial can be administered to a patient either 
by itself, or.in pharmaceutical compositions where it is mixed with suitable carriers 
or excipient(s). In treating an infection, a therapeutically effective amount of an 
35 agent or agents is administered. A therapeutically effective dose refers to that 
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amount of the compound that results in amelioration of one or more symptoms of 
bacterial infection and/or a prolongation of patient survival or patient comfort. 

Toxicity, therapeutic and prophylactic efficacy of anti-microbials can be 
determined by standard pharmaceutical procedures in cell cultures and/or 
5 experimental organisms such as animals, e.g. t for determining the LD 50 (the dose 
lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 
50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD 5O /ED 50 . Compounds which 
exhibit large therapeutic indices are preferred. The data obtained from these cell 
10 culture assays and animal studies can be used in formulating a range of dosage for 
use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage 
may vary within this range depending upon the dosage form employed and the route 

*f: of administration utilized. 

4= 

O 1 5 For any compound identified and used in the method of the invention, the 

\ I 

- 1: therapeutically effective dose can be estimated initially from cell culture assays, 

til Such information can be used to more accurately determine useful doses in 

3 ; organisms such as plants and animals, preferably mammals, and most preferably 

"■if humans. Levels in plasma may be measured, for example, by HPLC or other means 

fij 20 appropriate for detection of the particular compound. 

*J;( The exact formulation, route of administration and dosage can be chosen by 

'4) the individual physician in view of the patient's condition (see e.g. Fingl et. al., in 

The Pharmacological Basis of Therapeutics . 1975, Ch. 1 p.l). 

It should be noted that the attending physician would know how and when to 
25 terminate, interrupt, or adjust administration due to toxicity, organ dysfunction, or 
other systemic malady. Conversely, the attending physician would also know to 
adjust treatment to higher levels if the clinical response were not adequate 
(precluding toxicity). The magnitude of an administered dose in the management of 
the disorder of interest will vary with the severity of the condition to be treated and 
30 the route of administration. The severity of the condition may, for example, be 

evaluated, in part, by standard prognostic evaluation methods. Further, the dose and 
perhaps dose frequency, will also vary according to the age, body weight, and 
response of the individual patient. A program comparable to that discussed above 
also may be used in veterinary or phyto medicine. 

SD-110719.1 



39 ^ Patent 

241/190 



Depending on the specific infection target being treated and the method 
selected, such agents may be formulated and administered systemically or locally, 
i.e., topically. Techniques for formulation and administration may be found in 
Alfonso and Gennaro (1995). Suitable routes may include , for example, oral, rectal, 
5 transdermal, vaginal, transmucosal, intestinal, parenteral, intramuscular, 

subcutaneous, or intramedullary injections, as well as intrathecal, intravenous, or 
intraperitoneal injections. 

For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
10 Ringer's solution, or physiological saline buffer. For transmucosal administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 
p Use of pharmaceutical^ acceptable carriers to formulate identified anti- 

\l;f microbials of the present invention into dosages suitable for systemic administration 

O 15 is within the scope of the invention. With proper choice of carrier and suitable 

manufacturing practice, the compositions of the present invention, in particular those 
formulated as solutions, may be administered parenterally, such as by intravenous 
injection. Appropriate compounds can be formulated readily using pharmaceutical^ 
acceptable carriers well known in the art into dosages suitable for oral 
20 administration. Such carriers enable the compounds of the invention to be 

formulated as tablets, pills, capsules, liquids, gels, syrups, slurries, suspensions and 
%!3 the like, for oral ingestion by a patient to be treated. 

Agents intended to be administered intracellularly may be administered using 
techniques well known to those of ordinary skill in the art. For example, such agents 
25 may be encapsulated into liposomes, then administered as described above. 

Liposomes are spherical lipid bilayers with aqueous interiors. All molecules present 
in an aqueous solution at the time of liposome formation are incorporated into the 
aqueous interior. The liposomal contents are both protected from the external 
microenvironment and, because liposomes fuse with cell membranes, are efficiently 
30 delivered into the cell cytoplasm. Additionally, due to their hydrophobicity, small 
organic molecules may be directly administered intracellularly. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
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achieve the intended purpose. Determination of the effective amounts is well within 
the capability of those skilled in the art. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically acceptable carriers comprising excipients and 
5 auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutically. The preparations formulated for oral 
administration may be in the form of tablets, dragees, capsules, or solutions, 
including those formulated for delayed release or only to be released when the 
pharmaceutical reaches the small or large intestine. 
1 0 The pharmaceutical compositions of the present invention may be 

manufactured in a manner that is itself known, e.g., by means of conventional 
mixing, dissolving, granulating, dragee-making, levitating, emulsifying, 
P encapsulating, entrapping or lyophilizing processes. 

Pharmaceutical formulations for parenteral administration include aqueous 



n 1 5 solutions of the active anti-microbial compounds in water-soluble form. 



a 

n 



m 



Alternatively, suspensions of the active compounds may be prepared as appropriate 
oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, 
or liposomes. Aqueous injection suspensions may contain substances which 
ry 20 increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, 

sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers 
or agents which increase the solubility of the compounds to allow for the preparation 
of highly concentrated solutions. 

Pharmaceutical preparations for oral use can be obtained by combining the 
25 active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 
obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as 
sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such 
as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
30 tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 
disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, or alginic acid or a salt thereof such as sodium alginate. 



SD-110719.1 



rrl 



41 Patent 

241/190 



Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 
dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
5 Dyestuffs or pigments may be added to the tablets or dragee coatings for 

identification or to characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit 
capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 

10 ingredients in admixture with filler such as lactose, binders such as starches, and/or 
lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, 
such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. 

15 The above methodologies may be employed either actively or 

prophylactically against an infection of interest. 



To identify DNA segments of Staph A bacteriophage 77 capable of acting as 
anti-microbial agents, a strategy described in a existing provisional patent was 
20 employed (U.S. Provisional Application No. 60/1 10,992, and filed Dec. 3, 1998). In 
essence, the procedure involved sequence characterization of the bacteriophage, 
& identification of protein coding regions (open reading frames or ORFs), subcloning 

of all ORFs into an appropriate inducible expression vector, transfer of the ORF 
subclones into Staph. A, followed by induction of ORF expression and assessment of 
25 effect on growth. We employed discovery steps as described in the Examples. 

Example I. Growth of Staph A bacteriophage 77 and purification of genomic 
DNA. 

The Staphylococcus aureus propagating strain (PS 77; ATCC #27699) was 
30 used as a host to propagate its respective phage 77 (ATCC # 27699-B1). Two rounds 
of plaque purification of phage 77 were performed on soft agar essentially as 
described in Sambrook et al (1989). Briefly, the PS 77 strain was grown overnight at 
37°C in Nutrient broth [NB: 0.3% Bacto beef extract, 0.5% Bacto peptone (Difco 
Laboratories) and 0.5% NaCl (w/v)].The culture was then diluted 20x in NB and 
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incubated at 37°C until the OD^ .2 (early log phase) with constant agitation. In 
order to obtain single plaques, phage 77 was subjected to 10- fold serial dilutions 
using phage buffer (1 mM MgS0 4 , 5 mM MgCl 2 , 80 mM NaCl and 0.1% Gelatin 
(w/v)) and 10 jil of each dilution was used to infect 0.5 ml of the cell suspension in 
the presence of 400 ng/ml CaCl 2 . After incubation of 15 min at room temperature 
(RT), 2 ml of melted soft agar kept at 45°C (NB supplemented with 0.6% agar) was 
added to the mixture and poured onto the surface of 100 mm nutrient agar plates 
(0.3% Bacto Beef extract, 0.5% Bacto peptone, 0.5% NaCl and 1.5% Bacto agar 
(w/v)). After overnight incubation at 30°C, a single plaque was isolated, resuspended 
in 1 ml of phage buffer by end over end rotation for 2 hrs at 20°C, and the phage 
suspension was diluted and used for a second infection as described above. After 
overnight incubation at 30°C, a single plaque was isolated and used as a stock. 

The propagation procedure for bacteriophage 77 was modified from the agar 
layer method of Swanstorm and Adams (1951). Briefly, the PS 77 strain was grown 
to stationary phase overnight at 37°C in Nutrient broth. The culture was then diluted 
twenty-fold in NB and incubated at 37°C until the OD 540 = .2. The suspension 
(15xl0 7 Bacteria) was then mixed with 15xl0 5 plaque forming units (pfu) to give a 
ratio of 100-bacteria/phage particle in the presence of 400 |ig/ml of CaCl 2 . After 
incubation for 15 min at 20°C, 7.5 ml of melted soft agar (NB plus 0.6% agar) were 
added to the mixture and poured onto the surface of 150 mm nutrient agar plates and 
incubated 16 hrs at 30°C To collect the phage plate lysate, 20 ml of NB were added 
to each plate and the soft agar layer was collected by scrapping off with a clean 
microscope slide followed by shaking of the agar suspension for 5 min to break up 
the agar. The mixture was then centrifiiged for 10 min at 4,000 RPM (2,830xg) in a 
JA-10 rotpr (Beckman) and the supernatant fluid (lysate) was collected and 
subjected to a treatment with 10 jig /ml of DNase I and RNase A for 30 min at 37°C. 
To precipitate the phage particles, the phage suspension was adjusted to 10% (w/v) 
PEG 8000 and 0.5 M of NaCl followed by incubation at 4°C for 16 hrs. The phage 
was recovered by centrifugation at 4,000 rpm (3,500xg) for 20 min at 4°C on a GS- 
6R table top centrifuge (Beckman). The pellet was resuspended with 2 ml of phage 
buffer (1 mM MgS0 4 , 5 mM MgCl 2 , 80 mM NaCl and 0.1% Gelatin). The phage 
suspension was extracted with 1 volume of chloroform and further purified by 
centrifugation on a cesium chloride step gradient as described in Sambrook et al. 
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(1989), using a TLS 55 rotor centrifixged in an Optima TLX ultracentrifuge 
(Beckman) for 2 h at 28,000 rpm (67,000xg) at 4°C. Banded phage was collected 
and ultracentrifuged again on an isopycnic cesium chloride gradient (1.45 g/ml) at 
40,000 rpm (64,000xg) for 24 h at 4°C using a TLV rotor (Beckman). The phage 
5 was harvested and dialyzed for 4 h at room temperature against 4 L of dialysis 
buffer consisting of 10 mM NaCl, 50 mM Tris-HCl [pH 8] and 10 mM MgCl 2 . 
Phage DNA was prepared from the phage suspension by adding 20 mM EDTA, 50 
mg/ml Proteinase K and 0.5% SDS and incubating for 1 h at 65°C, followed by 
successive extractions with 1 volume of phenol, 1 volume of phenol-chloroform and 
10 1 volume of chloroform. The DNA was then dialyzed overnight at 4°C against 4 L of 
TE (10 mM Tris pH 8.0, ImM EDTA). 

Example II. DNA sequencing of Bacteriophage 77 genome 

Four micrograms of phage DNA was diluted in 200 |il of TE (10 mM Tris, 

15 [pH 8.0], 1 mM EDTA) in a 1.5 ml eppendorf tube and sonication was performed 
(550 Sonic Dismembrator™, Fisher Scientific). Samples were sonicated under an 
amplitude of 3 jam with bursts of 5 s spaced by 15 s cooling in ice/water for 3 to 4 
cycles. The sonicated DNA was then size fractionated by electrophoresis on 1% 
agarose gels utilizing TAE (1 x TAE is: 40 mM Tris-acetate, 1 mM EDTA [pH 8.0]) 

20 as the running buffer. Fractions ranging from 1 to 2 kbp were excised from the 
agarose gel and purified using a commercial DNA extraction system according to 
the instructions of the manufacturer (Qiagen), with a final elution of 50 jil of 1 mM 
Tris (pH 8.5). 

The ends of the sonicated DNA fragments were repaired with a combination 
25 of T4 DNA polymerase and the Klenow fragment of E. coli DNA polymerase I, as 
follows. Reactions were performed in a reaction mixture (final volume, 1 00 |il) 
containing sonicated phage DNA, 10 mM Tris-HCl [pH 8.0], 50 mM NaCl, 10 mM 
MgCl 2 , 1 mM DTT, 50 ng/ml BSA, 100 jiM of each dNTP and 15 units of T4 DNA 
polymerase (New England Biolabs) for 20 min at 12°C followed by addition of 12.5 
30 units of Klenow large fragment (New England Biolabs) for 15 min at room 

temperature. The reaction was stopped by two phenol/chloroform extractions and the 
DNA was precipitated with ethanol and the final DNA pellet was resuspended in 20 
^ilofH 2 0. 
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Blunt-ended DNA fragments were cloned by ligation directly into the Hinc U 
site of pKSII+ vector (New England Biolabs) dephosphorylated by treatment with 
calf intestinal alkaline phosphatase (New England Biolabs)-treated pKS n+ vector 
(Stratagene). A typical ligation reaction contained 100 ng of vector DNA, 2 to 5 \l\ 
of repaired sonicated phage DNA (50-100 ng) in a final volume of 20 jil containing 
800 units of T4 DNA ligase (New England Biolabs) and was incubated overnight at 
16°C. Transformation and selection of bacterial clones containing recombinant 
plasmids was performed in E. coli DHlOp according to standard procedures 
(Sambrooket al., 1989). 

Recombinant clones were picked from agar plates into 96-well plates 
containing 100 |al LB and 100 ^ig/ml ampicillin and incubated at 37°C. The presence 
of phage DNA insert was confirmed by PCR amplification using T3 and T7 primers 
flanking the Hinc II cloning site of the pKS 11+ vector. PCR amplification of foreign 
insert was performed in a 15 \x\ reaction volume containing 10 mM Tris (pH 8.3), 50 
mM KC1, 1.5 mM MgCl 2 , 0.02% gelatin, 1 jiM primer, 187.5 yM each dNTP, and 
0.75 units Taq polymerase (BRL). The thermocycling parameters were as follows: 2 
min initial denaturation at 94°C for 2 min, followed by 20 cycles of 30 sec 
denaturation at 94°C, 30 sec annealing at 57°C, and 2 min extension at 72°C, 
followed by a single extension step at 72°C for 10 min. Clones with insert sizes of 1 
to 2 kbp were selected and plasmid DNA was prepared from the selected clones 
using QIAprep™ spin miniprep kit (Qiagen). 

The nucleotide sequence of the extremities of each recombinant clone was 
determined using an ABI 377-36 automated sequencer with two types of chemistry: 
ABI prism Big Dye™ primer or ABI prism Big Dye™ terminator cycle sequencing 
ready reaction kit (Applied Biosystems). To ensure co-linearity of the sequence data 
and the genome, all regions of phage genome were sequenced at least once from 
both directions on two separate clones. In areas that this criteria was not initially 
met, a sequencing primer was selected and phage DNA was used directly as 
sequencing template employing ABI priSm Big Dye™ terminator cycle sequencing 
ready reaction kit. 

Example III. Bioinformatic management of primary nucleotide sequence. 
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Sequence contigs were assembled using Sequencher™ 3.1 software 
(GeneCodes). To close contig gaps, sequencing primers were selected near the edge 
of the contigs. Phage DNA was used directly as sequencing template employing 
ABI prism BIG DYE™ terminator cycle sequencing ready reaction kit. The 
complete sequence of bacteriophage 77 is shown in Table 1. 

(a sallwaie program waa developed and treed on - the asoomblod ac qngg 




bactenophage 77 to identify all putative ORFs larger than 33£Dderts7 Other ORF 
identification software can also be utilized, prefera|jly^rograms which allow 
alternative start codons. The software scajjs^tlie primary nucleotide sequence 
starting at nucleotide #1 for an apppefJriate start codon. Three possible selections can 
be made for defining the n^ttfreof the start codon; I) selection of ATG, II) selection 
of ATG or GTG, arjd^rfl) selection of either ATG, GTG, TTG, CTG, ATT, ATC, 
and ATA. Tljklatter initiation codon set corresponds to the one reported by the 
NCBI n^://www.ncbi.nlm.nih.gov/htbin-post/Taxonomv/wprintgc?mode=c > ) for 
4hG4 m ot oi ii i hgg r ^ 



When an appropriate start codon is encountered, a counting mechanism is 
employed to count the number of codons (groups of three nucleotides) between this 
start codon and the next stop codon downstream of it. If a threshold value of 33 is 
reached, or exceeded, then the sequence encompassed by these two codons (start and 
stop codons) is defined as an ORF. This procedure is repeated, each time starting at 
the next nucleotide following the previous stop codon found, in order to identify all 
the other putative ORFs. The scan is performed on all three reading frames of both 
DNA strands of the phage sequence. 

using an implementation of BLAST programs, although any of a vaciet^of different 
sequence comparison and matching programs can be utilized^as known to those 
skilled in the art. Downloaded public databases usedTor sequence analysis include: 

i) non-redundant GenBank (ftp://ncbi.nlip^rilirgov/blast/db/nr.Z), 

ii) Swissprot (ftp://ncbi.nlm.nih^pv/Wast/db/swissprot.Z); 

iii) vector (ftp://ncbi.nlm^Mtf^o 

iv) pdbaa databasgs^np://ncbi.nlm.nih.gov/blast/db/pdbaa.Z); 

v) staphylpe6ccus aureus NCTC 8325 (ftp://ftp.genome.ou.edu/pub/staph/staph- 

-sfreptocQccufr jt y ugeiiea (flp.//flp.geilOitie.op.cdu/pub/5Ucp/atAcp lk.fe ); 
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(ftp://ftp.tigr.0rg/pub/data/s jneumoniae/gspxontigsJJ^J 
viii) mycobacterium tuberculosis CSU#9. 

(ftp://ftp.tigr.org/pu^/datS7r^ and ix) 

pseudomgnas*aeruginosa 



ie.washington.edu/pseudo?^^S5i3jr 
The results of the homology searches performed on the ORFs is shown in 



Table 4. 
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Example IV. Subcloning of Bacteriophage 77 ORFs into a Staph A inducible 
expression system. 



ctSTp l uuiT, in which the firefly luciferase (lucFF) exppe^Ton 
is controlled by the ars (arsenite) promoter/operator (Tauriainen ^^\991), was 
modified in the following fashion. Two oligonucleotides^orfesponding to a short 
antigenic peptide derived from the heamaglutinin ipKffein of influenza virus (HA 
epitope tag) were synthesized (Field et al., 19#6). The sense strand HA tag sequence 
(with BamHl, Sail and HindHl clonmg^ites) is: 

5 '-gatcccggtcgaccaagcttTACCpXTACGACGTCCCAGACTACGCC AGCTGA-3 * 
(where upper case letters detfiote the nucletotide sequence of the HA tag); the 
antisense strand HA tag sequence (with a Hindd cloning site) is: 
5 '-agctTCAGCTgrGCGTAGTCTGGGACGTCGTATGGGTAaagcttggtcgaccgg-3 * 
(where upperx^ase letters denote the sequence of the HA tag). The two HA tag 
oligonucleotides were annealed and ligated into pT0021 vector which had been 
digested tfith BamHl and HindSL. This manipulation resulted in replacement of the 
lucEF gene by the HA tag. This modified shuttle vector containing the arsenite 
isfaucible promoter, the arsR gene, and HA tag was named pTHA. A diagram 
iutfe^glbur modiricatKttr'Pf pT0021 to generate pi HA j?shownltriTgr4^ 

^jga gh ORF , ene oded4>ySai: le i iuphage 77, lar gerttifflr33-annn£ 
having a Shine-Dalgarno sequence upstream of the nutiatforTcodon was selected for 
functional analysis for bacterial inhibitioji^-Iirtotal, 98 ORFs were selected and 
screened as detailed below. AJistUr these is presented in Table 5. Each individual 
ORF, from initiationpodonto last codon (excluding the stop codon), was amplified 
from phage^gen^mic DNA using the polymerase chain reaction (PCR). For PCR 
sm^ pfr^at W^^ initiation codon-andu^> 
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-pr eccd o d bv a Jflni/ilwsliiclioii site f Y c ggg atcc 3S ) an d each anti cc nsc oligonu cleo t ide 
targets the pentultimate codon (the one before the stop codon) of the ORF^ndls 
preceded by a Sal I restriction site (^gcgtcgaccg 3 ). The PCR projiucfof each ORF 
was gel purified and digested with BamHl and Sail. Th^di^ested PCR product was 

5 then gel purified using the Qiagen kit as describ^d^figated into BamHl and Sail 
digested pTHA vector, and used to transfonff^. coli bacterial strain DH10P(as 
described above). As a result of this mmiipulation, the HA tag is set inframe with the 
ORF and is positioned at the cartx$xy terminus of each ORF (pTHA/ORF clones). 
Recombinant pTHA/ORF ckrfies were picked and their insert sizes were confirmed 

10 by PCR analysis using primers flanking the cloning site. The names and sequences 
of the primers that wdre used for the PCR amplification were: HAF: 
5 TATTATCCAAAACTTGAACA r ; HAR: 5 'CGGTGGTATATCCAGTGATT 3 '. 
The sequenp^integrity of cloned ORFs was verified directly by DNA sequencing 
using primers HAF and HAR. In cases where verification of ORF sequence could 



15 notjbe achieved by one pass with the sequencing primers, additional internal primers 



fe^& lcctcd and used for -seq ii e ncinfo 
r ; i Staphylococcus aureus strain RN4220 (Kreiswirth et al, 1983) was used as a 

H = recipient for the expression of recombinant plasmids. Electoporation was performed 

□ essentially as previously described (Schenk and Laddaga, 1992). Selection of 

J: { 20 recombinant clones was performed on Luria-Broth agar (LB-agar) plates containing 
30 ng/ml of kanamycin. 

For each ORF introduced in the pTHA plasmid, 3 independent transformants 
were isolated and used to individually inoculate cultures in 5 ml of TSB containing 
30jag/ml kanamycin, followed by growth to saturation (16 hrs at 30°C). An aliquot 
25 of this stationary phase culture was used to generate a frozen glycerol stock of the 
transformant ( stored at - 80°C). The remaining culture was used for plasmid DNA 
extraction. Bacterial cells were harvested by centrifugation at 3000 x g at 22°C for 5 
min. The pellet was resuspended in 200 |il 25% sucrose containing 25U/ml of 
lysostaphin and incubated for 15 min at 37°C. Then, 400^1 of alkaline SDS solution 
30 (3% SDS, 0.2N NaOH) were, added, well mixed and incubated for 7 min at room 
temperature. After the alkaline SDS treatment, 300|il of ice-cold 3M sodium acetate 
pH 4.8 were added, and the mix is immediately spun at 13000g for 1 5 min at room 
temperature. The supernatant was transferred to a new 1 .5 ml conical centrifuge tube 
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and 650^1 of isopropanol (stored at room temperature) were added. The mix was 
then centrifuged at 13,000 x g for 5 min. The supernatant fluid was discarded, the 
pellet washed with 70% ethanol, and resuspended in 320 |al sterile distilled water. 
The presence of individual phage 77 ORF DNA inserts in the plasmid was 
5 verified by PCR amplification using 1.5 \i\ transfonnant miniprep DNA in a PCR 
with primers flanking the cloning site of ORF in pTHA vector (HAF and HAR). The 
composition of the PCR reaction and the cycling parameters are identical to those 
employed for library screening described above. 

10 Example V. Functional assay for bacterial inhibitory activity of bacteriophage 77 



The anti-microbial activity of individual phage 77 ORFs was monitored by 
two growth inhibitory assays, one on solid agar medium, the other in liquid medium. 
In general, Staphylococcus bacteria transformed with expression plasmids 
15 containing individual ORFs were grown in normal TSA medium and stored in 19% 
* glycerol. At pre-determined times, arsenite was added to the culture to induce 
transcription of the phage 77 ORFs cloned immediately downstream from an 
arsenite-inducible promoter in the pTHA expression plasmid. 



20 monitored and quantitated. The growth inhibition assay on solid medium was 

performed by streaking pTHA/ORF containing 5. aureus transformant onto LB-Kn 
and TSA-Kn plates containing increasing concentrations of sodium arsenite (0; 2.5; 
5; and 7.5 \iM). Arsenite is used Jo induce the expression of cloned DNA in pTHA 
vector. In parallel, 3 |il of 1/10 and 1/100 dilutions of the frozen cultures of the 

25 pTHA/ORF transformants were spotted as single drops onto LB-Kn and TSA-Kn 
plates containing increasing concentration of sodium arsenite (0; 2.5; 5; and 7.5 
HM). The plates were then incubated 16 hrs at 37°C, and the effect of arsenite- 
induced ORF expression on bacterial growth was monitored and quantitated by 
comparing the extent to that seen in control plates. As positive controls for growth 

30 inhibition,the holin/lysin genes of the Staphylococcus aureus phage Twort 

(Loessher et al., 1998) was subcloned into the pTHA ars inducible vector and used. 

For the growth inhibition assay in liquid medium, stationary phase cultures 
were prepared by inoculating 2.5ml TSB-Kn with frozen S. aureus RN4220 



ORFs. 



The effect of ORF induction on bacterial growth characteristics was then 
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transformants containing phage 77 ORFs cloned in pTHA vector followed by 
incubation for 16 hrs at 37°C. These cultures were then diluted 1/100 in the same 
medium, and the bacteria were allowed to grow for 2 hrs at 37°C to reach early log 
phase. 1 50 \x\ of such culture were then mixed with 2.35 ml TSB-Kn medium with 
5 or without arsenite (the final concentration of arsenite in the medium was 0 or 5 |iM 
arsenite). After 3.5 hrs incubation at 37°C with shaking at 250 rpm, 100 \i\ of 
bacterial culture was removed from each tube for OD 565 measurement. Serial ten- 
fold dilutions of the culture in buffered saline solution (0.85% NaCl) were then 
spotted onto TSB-Kn plates. The plates were incubated at 37°C 16 hrs and the 
10 number of surviving colonies counted the following day. The growth inhibitory 

property of individual ORFs was then quantitated by comparing CFU numbers under 
normal or arsenite-induction conditions. Inhibition results are shown in Figures 5 
and 5. 
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All patents and publications mentioned in the specification are indicative of 
the levels of skill of those skilled in the art to which the invention pertains. All 
references cited in this disclosure are incorporated by reference to the same extent as 
if each reference had been incorporated by reference in its entirety individually. 
15 One skilled in the art would readily appreciate that the present invention is 

well adapted to carry out the objects and obtain the ends and advantages mentioned, 
as well as those inherent therein. The specific methods and compositions described 
herein as presently representative of preferred embodiments are exemplary and are 
not intended as limitations on the scope of the invention. Changes therein and other 
vli 20 uses will occur to those skilled in the art which are encompassed within the spirit of 
1 :: the invention are defined by the scope of the claims. 

Ill 

It will be readily apparent to one skilled in the art that varying substitutions 
and modifications may be made to the invention disclosed herein without departing 
from the scope and spirit of the invention. For example, those skilled in the art will 
25 recognize that the invention may suitably be practiced using a variety of different 
expression vectors and sequencing methods within the general descriptions 
provided. 

The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not 
30 specifically disclosed herein. Thus, for example, in each instance herein any of the 
terms "comprising," "consisting essentially of and "consisting of may be replaced 
with either of the other two terms. The terms and expressions which have been 
employed are used as terms of description and not of limitation, and there is not . 
intention that in the use of such terms and expressions of excluding any equivalents 
35 of the features shown and described or portions thereof, but it is recognized that 

various modifications are possible within the scope of the invention claimed. Thus, 
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it should be understood that although the present invention has been specifically 
disclosed by preferred embodiments and optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the 
art, and that such modifications and variations are considered to be within the scope 

5 of this invention as defined by the appended claims. 

In addition, where features or aspects of the invention are described in terms 
of Markush groups or other grouping of alternatives, those skilled in the art will 
recognize that the invention is also thereby described in terms of any individual 
member or subgroup of members of the Markush group or other group. For 

10 example, if there are alternatives A, B, and C, all of the following possibilities are 
included: A separately, B separately, C separately, A and B, A and C, B and C, and 
A and B and C. 

Thus, additional embodiments are within the scope of the invention and 
within the following claims. 
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